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Abstract 

Self-assembling  materials  have  gained  attention  in  the  field  of  nanotechnology  due  to  their 
potential  to  be  used  as  building  blocks  for  fabricating  complex  nanoscale  devices.  The 
biological  world  is  abundant  with  examples  of  functional  self-assembling  biomolecules. 
Proteins  are  one  such  example,  found  in  a  variety  of  geometries  and  shapes.  This  research  is 
focussed  on  the  use  of  ring-shaped  self-assembling  proteins,  called  Lsm  proteins,  as 
componentary  for  applications  in  bionanotechnology.  Lsm  proteins  were  used  because  of 
their  spontaneous  association  into  stable  rings,  tolerance  to  mutations,  and  affinity  to  RNA. 
This  thesis  primarily  focussed  on  the  thermophilic  Lsma  (from  Methanobacterium. 
thermoautotrophicum )  that  assembles  as  heptameric  rings. 

The  oligomeric  state  of  the  heptameric  protein,  and  hence  the  diameter  of  its  central  cavity, 
was  manipulated  by  judiciously  altering  appropriate  residues  at  the  subunit  interface.  Lsma 
presented  a  complex  set  of  interactions  at  the  interface.  Out  of  the  mutations  introduced, 
R65P  yielded  a  protein  for  which  SEC  and  SAXS  data  were  consistent  with  a  hexameric 
state.  Moreover,  key  residues,  L70  and  171,  were  identified  that  contribute  to  the  stability  of 
the  toroid  structure. 

Covalent  linking  of  rings  provided  nanotubular  structures.  To  achieve  this,  the  surface  of  the 
Lsma  ring  scaffold  was  modified  with  Cys  residues.  This  approach  led  to  the  formation  of 
novel  Lsma  nanotubes  approximately  20  nm  in  length.  Importantly,  the  assembly  could  be 
controlled  by  changing  the  redox  conditions.  As  an  alternative  method  to  manipulate  the 
supramolecular  assembly,  His6-tags  were  attached  at  the  termini  of  the  Lsma  sequence.  The 
higher-order  organisation  of  the  constructs  was  influenced  by  the  position  of  the  His6-tag. 
The  N-terminally  attached  His6-tag  version  of  Lsma  showed  a  metal-dependent  assembly  into 
cage-like  structures,  approximately  9  nm  across.  This  organisation  was  highly  stable, 
reproducible,  and  reversible  in  nature. 

The  results  presented  in  this  thesis  aid  the  understanding  of  generating  complex 
nanostructures  via  in  vitro  self-assembly.  The  Lsma  rings  were  assembled  into  higher-order 
architectures  at  the  quaternary  level  by  employing  protein  engineering  strategies.  Future  work 
is  necessary  to  functionalise  these  supramolecular  structures;  however,  this  study  confirms 
the  potential  role  of  Lsma  proteins  as  a  molecular  building  block  in  bionanotechnology. 
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Chapter  one :  Introduction 


1  Chapter  one 
Introduction 

1 .1  Context 

A  role  has  been  recognised  for  a  number  of  protein  assemblies  in  the  thriving  new  field  of 
bionanotechnology,  where  their  unique  quaternary  geometries  may  be  used  in  a  number  of 
ways;  for  example,  as  nanotubes  or  biocontainers.  This  thesis  examines  a  self-assembling 
homo-oligomeric  ring-shaped  protein  that  is  a  part  of  a  family  of  RNA  chaperones.  The 
protein  was  used  as  a  building  block  to  construct  new  bionanostructures.  The  design  and 
solution-based  structural  studies  have  been  carried  out  using  a  combination  of  experimental 
approaches  such  as  site-directed  mutagenesis,  size  exclusion  chromatography  coupled  static 
light  scattering  and  small  angle  X-ray  scattering.  The  nanostructures  obtained  were  examined 
with  transmission  electron  microscopy. 

This  chapter  provides  a  background  on  self-assembling  protein  systems,  strategies  used  to 
generate  novel  nanostructures,  and  how  these  unique  supramolecular  organisations  may  fit 
into  the  field  of  bionanotechnology. 

1.2  Background 

The  field  of  nanoscience  holds  promise  to  deliver  the  next  generation  of  advanced  technology 
1;  2.  Despite  being  a  broad  field,  it  has  witnessed  tremendous  growth  over  the  past  few  years. 
It  builds  upon  principles  and  expertise  from  diverse  fields  such  as  physics,  chemistry, 
material  science,  biology  and  engineering.  A  sub-branch,  bionanotechnology,  is  a  highly 
interdisciplinary  field  that  aims  at  developing  methods  to  fabricate  nanoscale  structures  with 
biological  and  abiological  components  that  respond  to  their  external  environment 3.  There  are 
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many  examples  of  biological  entities,  such  as  oligonucleotides,  proteins,  viral  capsids  and 
cells,  used  for  fabrication  of  new  biomaterials  4.  Such  biological  components  have  been 
successfully  incorporated  in  biosensors,  bionanoarrays  and  nanoparticle  (NP)  conjugates  5. 
As  more  and  more  characterisation  and  visualisation  techniques  become  available,  we  can 
expect  to  see  an  increase  in  the  variety  of  biomolecules  used  to  generate  functional 
nanostructures. 

Nucleic  acids  such  as  deoxyribonucleic  acid  (DNA)  and  ribonucleic  acid  (RNA)  have  been 
used  extensively  to  create  complex  3D  structures.  This  success  is  largely  attributed  to  their 
simple  Watson-Crick  base  pairing  and  ease  of  chemical  synthesis.  Peptides,  too,  have 
attracted  a  lot  of  attention  for  fabricating  new  nanostructures  6.  Despite  numerous  examples 
of  DNA  and  peptide  structures,  functionalisation  remains  a  big  hurdle  1 .  DNA  structures  are 
also  difficult  to  control  in  three  dimensions  8.  Proteins,  on  the  other  hand,  offer  diversity  in 
three  dimensional  architecture  and  flexibility  towards  modification  by  genetic  engineering 
and  chemical  methods.  They  already  possess  complex  architectures  and  a  variety  of  three 
dimensional  shapes  such  as  rings,  tubes  and  cages  that  can  be  used  as  components  to  create 
nanomachines  9.  Therefore,  proteins  are  increasingly  playing  an  important  role  as  building 
blocks  in  the  field  of  bionanotechnology  and  multicomponent  protein  systems  are 
increasingly  being  used  in  synthetic  biology  10.  They  exhibit  some  very  valuable  properties 
such  as  assembly  into  complex  supramolecular  structures,  interactions  with  specific  ligands 
and  sequence  dependent  function.  The  challenge  is  learning  to  control  these  interactions  in 
vitro  to  engender  useful  functionality. 
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1.3  Bionanotechnology-  an  interface  between  biology  and  physical 
sciences 

Bionanotechnology  combines  knowledge  from  life  sciences  and  physical  sciences  to  fabricate 
responsive  devices  that  can  contribute  to  the  development  of  drug  delivery  systems,  sensors 
and  “smart”  biomaterials.  The  field  of  nanotechnology  has  witnessed  reasonable  progress  in 
fabrication  of  devices  by  lithographic  techniques,  but  to  construct  a  device  at  nanoscale  still 
remains  a  challenge  u.  Thus,  a  biomimetic  approach  towards  designing  a  nanoscale  device  is 
fast  gaining  momentum  where  principles  of  biological  systems  such  as  molecular  self- 
assembly  are  replicated  and  applied  to  synthesise  artificial  nanostructures. 


"Top-down" 


Lithography, 

patterning 


Well-ordered  assembly 


Self-assembly 


"Bottom-up" 


Molecular  recognition 
(electrostatic,  hydrophobic, 
aromatic  interactions,  etc.) 


Simple  building  blocks 


Figure  1.1 1  A  comparison  of  the  “top-down”  and  “bottom-up”  approaches  applied  to 
manufacturing  of  nanomaterials.  The  “bottom-up  ”  approach  is  based  on  the  interactions  of 
simple  building  blocks  which  can  self-assemble  into  well-ordered  complex  structures. 
Schematic  adapted  from  Gazit  (2007)  . 
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1 .3.1  Self-assembly 

Self-assembly  is  the  phenomenon  by  which  molecules  spontaneously  associate  to  form 
ordered  systems  through  a  combination  of  non-covalent  interactions  u.  As  shown  in  Figure 
1.1,  the  relevant  associations  take  place  through  hydrogen  bonding,  van  der  Waals 
interactions,  aromatic  stacking,  and  in  some  cases  metal  coordination,  leading  to  the 
formation  of  stable  assemblies  with  a  defined  shape  and  architecture  13.  Harnessing  these 
interactions  in  a  biomolecular  self-assembling  system  offers  promise  for  the  development  of 
bionanomaterials  with  unique  properties. 

1.3.1. 1  Self-assembly  in  biology 

Biological  systems  abound  with  examples  of  hierarchical  self-assembly.  From  a  school  of 
fish  to  a  bacterial  swarm,  all  these  processes  involve  some  kind  of  distinct  pattern  formation 
and  dynamic  self-assembly  u.  Cellular  processes  such  as  mitosis,  DNA  replication,  protein 
transcription  and  translation  and  protein  signalling  are  facilitated  by  the  phenomenon  of  self- 
assembly,  making  it  one  of  the  most  fundamental  processes  to  support  life  14 .  It  involves  an 
interplay  of  associations  among  molecules  such  as  nucleic  acids,  proteins  and 
oligosaccharides.  Virtually  all  functions  in  a  living  cell  are  carried  out  by  intricate  protein 
machineries  that  assemble  in  vivo  through  transient  interactions.  These  highly  precise  and 
regulated  interactions  give  rise  to  molecular  machines  in  the  cells  such  as  ATP  synthase  and 
ribosomes  that  are  capable  of  performing  highly  sophisticated  functions  15.  Viruses  are 
supramolecules  based  on  assembly  of  protein  modules  and  nucleic  acid,  and  form  different 
shapes  in  a  variety  of  sizes  16 .  The  tobacco  mosaic  virus  (TMV)  self-assembles  into  rod-like 
structures  with  more  than  2000  protein  structural  units  assembling  around  an  RNA  molecule 
to  form  a  300  nm  long  cylinder  17 .  These  nanoscale  cylinders  can  be  reconstituted  if  broken 
into  their  individual  components.  TMV  molecules  represent  an  excellent  example  of  a  robust 
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biological  self-assembled  system.  Such  biological  organisations  serve  as  an  inspiration  and  a 
platform  to  artificially  synthesise  molecular  structures  in  vitro  based  on  the  principles  of 
hierarchical  self-assembly  18. 

1.3. 1.2  In  vitro  self-assembly 

There  are  two  ways  to  obtain  nanostructures.  The  "top-down"  approach  essentially  is 
breaking  down  or  decomposing  large  structural  systems  to  their  constituent  parts.  In 
nanotechnology,  this  involves  using  large  external  devices  to  create  nanoscale  structures  and 
includes  processes  such  as  nanolithography  (Figure  1.1).  "Bottom-up"  approaches  involve 
the  interaction  of  atoms  and  molecules  to  self-assemble  into  large  complex  structures  by 
means  of  various  intermolecular  interactions.  It  is  generally  believed  that  the  "bottom-up" 
approach  is  relatively  energy  efficient  and,  in  principle,  cheaper  for  mass  production  of 
nanoscale  devices  19.  The  study  of  self-assembly  can  undoubtedly  lead  to  the  discovery  of 
intriguing  protein  nanosystems  that  can  have  interesting  applications  in  the  synthesis  of  novel 
biomaterials  and  biosensors  (or  nanobiosensors). 

In  order  to  use  biomolecules  in  conjunction  with  material  science,  the  biological  entities  have 
to  be  manipulated  beyond  their  physiological  parameters.  Lipids,  carbohydrates,  and  peptides 
have  all  been  explored  as  building  blocks  to  create  supramolecular  materials  13.  Significant 
success  has  been  seen  for  peptide  systems  with  a  self-assembling  propensity  which  have  been 
used  in  hydrogel  biomaterials  20 ,  tissue  engineering,  and  3D  cell  culture  21 .  Nucleic  acids 
offer  a  relatively  simple  mechanism  of  interaction  and  have  been  used  for  diagnostic  and 
biosensing  applications  22,  but  lack  the  ability  to  be  functionalised  and  form  a  variety  of 
complex  3D  structures  23. 
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Algorithm-based  design  of  organised  supramolecular  structures  generates  a  large  library  of 
nanomaterials  with  distinct  configurations  and  allows  for  a  large  combination  of  energetically 
favourable  structures  to  be  tested  24.  Huge  strides  have  been  made  by  this  approach  to 
generate  structures  with  atomic  level  precision  25.  Baker  et  al.  have  successfully  generated  a 
variety  of  cage-like  architectures  from  simple  trimeric  protein  building  blocks  by  using  the 
RosettaDesign  algorithm  and  extending  it  for  modelling  multiple  protein  interface 
interactions  .  This  approach  can  give  rise  to  a  variety  of  morphologies  by  using  building 
blocks  of  different  symmetries,  combined  in  geometrically  specific  ways  11 . 

Nearly  every  protein  forms  transient  or  permanent  complexes  with  metal  ions  to  facilitate 

oo 

their  structural  or  functional  role  .  With  new  methods  in  protein  engineering,  it  is  possible 
to  engineer  individual  metal  binding  sites  to  create  a  metal-induced  self-assembly  event.  For 
example,  the  monomeric  component  of  microtubules,  tubulin,  crystallises  as  sheets  on 
coordination  with  Zn2+  29.  In  another  instance,  disulfide  bonds  were  engineered  on  a 
monomeric  protein  in  order  to  mimic  its  quaternary  structure  to  a  cryptand-like  complex  with 
Zn-binding  capacity  30.  Specific  sites  on  the  protein  were  targeted  to  generate  large-ordered 
macromolecular  structures  on  controlled  addition  of  metal  to  the  protein  solution.  The  same 
protein  was  also  induced  to  assemble  as  very  stable  protein  sheets  and  arrays  on  addition  of 
Zn2+  31.  Thus,  with  various  strategies  of  protein  engineering,  either  by  re-engineering  existing 
quaternary  assemblies  or  using  algorithm-based  strategies,  a  variety  of  functional  geometries 
can  be  generated  (Figure  1.2). 
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Figure  1.2\  The  process  of  self-assembly.  (A)  Hierarchical  organisation  of  simple  building 
blocks  into  ordered  functional  assemblies  found  in  vivo.  Adapted  from  Howorka  et  al.  (2007) 

32 

.  (B)  A  schematic  representation  of  common  nanostructures.  Adapted  from  Chopra  et  al. 
(2007)  33. 


1.4  Nucleic  acid  as  building  blocks 

Oligonucleotides  have  been  successfully  employed  in  the  creation  of  nucleic  acid-based 
nanoscale  assemblies  including  3D  nano  objects,  crystalline  arrays  and  nanogrids  34.  They 
can  be  synthesised  rapidly  and  reliably  and  their  3D  shape  can  be  controlled  by  specific 
oligonucleotide  sequence  motifs.  Thus,  the  architecture  of  nucleic  acid-based  nanostructures 
can  be  controlled  with  relative  ease.  DNA  is  a  simple  molecule  with  respect  to  its  constituent 
building  blocks,  the  four  deoxyribonucleotides,  through  which  it  encodes  the  entire  genomic 
information  of  the  cell.  A  single  strand  of  oligonucleotide  has  the  potential  to  fold  into  a 
predictable  and  stable  2D  or  3D  architecture,  making  DNA  an  excellent  molecule  for 
construction  of  nanoscale  structures.  Two-dimensional  structures  were  created  from  single 
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stranded  DNA,  using  a  four-arm  junction  as  the  basic  structural  unit.  These  2D  objects  were 

Q /T 

further  reconfigured  to  yield  three-dimensional  structures  and  curved  objects  .  In  another 
example,  fragments  of  nucleic  acids  were  used  as  modular  building  blocks  to  fabricate  robust 
cuboid  structures  with  interior  cavities  and  tunnels  37 . 

Focus  has  now  shifted  towards  investigating  RNA  as  an  equivalent  building  block  for 
nanostructures.  Recent  studies  show  that  by  incorporating  a  K-tum  motif  in  RNA  that  allows 
it  to  bend  at  angle  of  60°,  the  geometry  of  the  molecule  can  be  controlled.  An  equilateral 
triangle  shaped  RNA  molecule  was  successfully  synthesised  that  was  stabilised  at  the  corners 
by  an  RNA  binding  protein  (L7Ae)  (Figure  1.3).  The  repertoire  of  such  ribonucleoprotein 
(RNP)  structures  can  be  increased  by  incorporating  and  experimenting  with  many  other 
characterised  RNP  motifs  to  generate  novel  nanoarchitectures  35 .  Strategies  to  functionalise 
these  structures  are  beginning  to  emerge,  but  are  at  present  limited  to  DNAzymes  and 
ribozymes  38.  Due  to  the  paucity  of  chemical  groups  available,  functionalisation  of 
nucleotide-based  structures  is  a  challenge,  limiting  their  use  as  a  molecular  tecton. 


Tn-RNP-l 


•  35 

Figure  1.3 1  RNA-based  nanostructure.  An  RNP  equilateral  triangle  . 
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1.5  Proteins  and  peptides  as  functional  building  blocks 

Proteins  and  peptides  are  appealing  for  the  synthesis  of  nanoscale  devices  because  of  their 
versatility  in  function  and  variety  in  their  3D  structures.  They  can  be  made  cheaply  and 
quickly  either  by  synthetic  means  or  recombinant  methods.  Peptide  sequences  with  certain 
motifs  exhibit  self-assembling  properties  and  associate  to  form  sophisticated  structures. 
Extensive  work  has  been  carried  out  with  coiled-coil  motifs  39,  amyloid-like  structures 
comprised  of  P-strands  and  peptide  amphiphiles  40.  Peptide  supramolecular  chemistry  has 
witnessed  immense  progress  over  the  years.  Self-assembling  peptides  have  been  explored  to 
create  complex  nanoarchitectures  like  nanoribbons,  nanospheres,  and  nanotubes  (Figure 
1.2B).  Peptide-based  nanofibres  have  been  applied  in  3D  cell  culture  and  tissue  engineering 
41 .  Although  peptides  have  a  more  modular  and  robust  nature,  they  lack  the  chemical 
complexity  and  hence,  functionality,  exhibited  by  proteins  and  are  comparatively  more 
expensive  to  synthesise  at  a  large  scale. 

Proteins  are  formed  by  a  modular  approach,  wherein  subunits  interact  and  assemble  into  well 
defined  supramolecular  architecture,  through  both  permanent  and  transient  bonds  42, 43.  Thus, 
these  biomolecules  offer  an  inherent  complexity  and  are  capable  of  performing  a  fleet  of 
functions  in  biology.  The  quaternary  structures  formed  by  self-assembling  proteins  have  the 
potential  to  be  designed  and  manipulated  to  form  structures  that  can  be  useful  in 
nanotechnology.  However,  the  principles  of  protein  folding  are  still  not  fully  understood, 
which  makes  predictable  protein  design  difficult  44.  As  mentioned  in  Section  1.3.1.2, 
algorithm-based  protein  design  as  an  in  vitro  self-assembly  strategy  is  now  beginning  to 
witness  routine  success.  Multi-component  protein  nanomaterials  were  generated  using  such  a 
strategy  45 . 
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The  level  of  structural  complexity  increases  with  an  increase  in  the  number  of  components 
involved.  The  number  of  interactions  that  drive  the  assembly  offer  various  levels  of  control 
over  the  process,  thereby  providing  precision  and  directionality  to  construct  nanostructures. 
There  are  some  proteins  which  exhibit  excellent  self-assembling  properties  and  can  take  the 
shape  of  cages,  rings  and  tubes. 

The  first  protein  architecture  to  be  studied  as  a  nanostructure  was  a  cage  protein.  Ferritin  is  a 
24  subunit  ubiquitous  intracellular  protein  that  stores  iron  in  a  microcrystalline  non-toxic 
form  and  releases  it  in  a  controlled  manner  46.  Small  molecules  such  as  metal  ions  can  diffuse 
into  the  interior  cavity  via  acidic  channels  and  accumulate,  resulting  in  the  formation  of  NPs. 
The  dimensions  of  the  NPs  are  controlled  by  the  size  of  the  cage  structure.  Such  a  protein 
cage  architecture  was  used  as  a  ‘container’  for  synthesis  of  cobalt  oxide  minerals  (C03O4  and 
Co(0)OH  )  and  iron  oxide  47 . 

The  bacterial  tRNA  attenuation  protein  (TRAP)  assembles  as  an  11-mer  ring  with  a  pore 
diameter  of  approximately  2.0  nm  48.  It  is  involved  in  the  regulation  of  the  trp  operon 
encoding  for  enzymes  involved  in  tryptophan  synthesis  in  bacteria  49 .  This  protein  is  heat- 
stable  and  reasonably  tolerant  to  mutations,  making  it  a  very  convenient  protein  to  work  with 
in  vitro.  Heddle  et  al.  constructed  a  modified  protein  ring  by  linking  three  to  four  monomers 
which  resulted  in  the  formation  of  a  toroid  assembly  with  12  subunits  rather  than  an  11-mer 
as  found  naturally  50.  The  authors  further  created  nanodot  binding  modules  by  replacing  an 
unconserved  arginine  (Arg)  residue  in  the  inner  pore  with  a  cysteine  (Cys)  residue,  thus 
artificially  introducing  a  thiol  moiety  in  the  pore  of  protein.  With  only  one  Cys  residue 
present,  the  authors  were  able  to  bind  gold  NPs  in  the  inner  pore.  This  protein  was  further 
functionalised  by  attaching  a  titanium-binding  peptide  (TBP)  on  the  termini,  thereby 
immobilising  these  protein  rings  on  an  inorganic  surface  51  (Figure  1.4A).  With  further 
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modification  to  the  protein  ring,  this  group  also  demonstrated  that  TRAP  protein  can 
assemble  as  nanotubes  approximately  1  pm  in  length  through  disulfide  and  hydrophobic 
interactions  52  (Figure  1.4B). 

Stable  protein  1  (SP1)  is  a  highly  thermostable  protease  resistant  homo-oligomer  that  forms  a 
dodecamer,  first  isolated  from  Aspen  populus  53 .  In  a  series  of  experiments,  SP1  was  shown 
to  be  a  potential  candidate  for  use  as  a  building  block  to  generate  nanostructures.  The  enzyme 
glucose  oxidase  (GOx)  was  linked  to  the  SP1  subunit  that  subsequently  assembled  as  a 
nanotube  with  GOx  protruding  out 54.  It  was  further  shown  to  form  regular  arrays  and  interact 
with  gold  NPs,  successfully  forming  gold  nanowires  55.  More  recently,  a  gold  NP  conjugate 
of  SP1  linked  to  GOx  enzyme  was  layered  on  an  electrode  surface  by  means  of  di thiol 
bridging  units  56.  The  SPl-NPs  hybrid  was  also  successfully  used  as  a  component  of  a  basic 
logic  circuit 57 . 

Another  type  of  protein  architecture  with  growing  interest  is  protein  tubes,  as  they  can  form 
containers  for  controlled  drug  release  and  also  have  applications  in  electronics.  58;  59:  60.  A 
recent  example  of  such  application  is  HcpI  protein  from  Pseudomonas  aeruginosa.  HcpI  is  a 
small  17.4  kDa  protein  that  is  a  secreted  component  of  the  type  IV  secretion  system.  It  adopts 
a  homo-hexameric  ring-shaped  structure  with  an  outer  diameter  of  9  nm  and  inner  diameter 
of  4.0  nm  61 .  By  introducing  Cys  residues  on  the  top  and  bottom  surface  of  the  ring,  the 
protein  nanotube  was  stabilised  by  disulfide  bonds  (Figure  1.4C).  The  tube  was  capped  with 
HcpI  muteins  that  had  an  attached  epitope,  thus  creating  an  enclosed  elongated  cavity  which 
could  function  as  a  nanocapsule  .  Biophysical  characterisation  of  the  HcpI  protein  showed 
that  its  quaternary  structure  can  be  assembled  and  disassembled  without  disrupting  the 
secondary  structure  by  using  detergents,  thereby  providing  a  control  over  its  assembly 
process.  This  can  be  further  exploited  for  fabrication  of  nanocontainers  . 
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Figure  1.4\  (A)  Crystal  structure  of  TRAP  protein  showing  the  C-terminal  region  in  blue  and 
Arg  residue  in  yellow  51 .  (B)  TEM  of  mutant  TRAP  protein  tubes  in  presence  of  a  weak 


reducing  agent 52 .  (C)  Hcpl  protein  genetically  modified  to  assemble  into  tubules  62 . 
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Well-ordered  arrays  can  also  be  formed  by  capturing  inorganic  materials  into  the  central 
cavity  of  such  protein  rings.  An  example  illustrating  this  is  HSP60  protein  from  Sulfolobus 
shibatae,  in  which  the  chaperone  proteins  form  a  barrel-shaped  structure.  By  utilising  the 
reactive  thiol  group  of  the  incorporated  Cys  residues,  gold  NPs  as  well  as  semiconductor 
materials  such  as  CdSe-ZnS  were  attached  to  the  central  cavity  64 . 

Another  strategy  used  to  alter  the  functions  of  self-assembled  protein  structures  is  to 
incorporate  peptides  that  have  different  binding  capabilities  towards  inorganic  materials. 
There  have  been  extensive  studies  on  peptides  that  bind  to  metals  such  as  gold  65  and  silver  66 . 
Filamentous  temperature  sensitive  protein  Z  (FtsZ)  served  as  a  template  for  fabrication  of 
inorganic  materials.  By  incorporating  peptides  with  established  affinities  towards  gold,  silver 
and  nickel  ions,  protein  hybrid/inorganic  materials  were  synthesised  . 

NPs  can  also  be  used  as  a  template  for  assembly  of  protein  architectures.  Proteins  with  a 
His6-tag  bind  to  Au-NPs  derivitised  with  nitrilotriacetic  acid  (NTA),  thereby  assembling  as 
monomers,  dimers,  trimers,  and  spherical  shells  that  can  be  controlled  by  varying  the  NPs 
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size  .  In  another  example,  the  assembly  TMV  coat  protein  was  controlled  by  attachment  of 
a  His6-tag  which  caused  it  to  assemble  into  disks,  tubes,  arrays  and  fibres  17 . 

From  the  examples  of  protein  tectons  mentioned  above,  the  criteria  required  for  a  successful 
building  block  is  spontaneous  assembly  into  its  quaternary  form,  a  robust  nature,  stability 
over  a  range  of  buffer  conditions,  and  potential  for  functionalisation.  Although  a  plethora  of 
proteins  are  available  to  be  studied,  their  inherent  complexity  brings  with  it  a  new  level  of 
difficulty  to  manipulate  them  into  useful  architectures.  Hence,  the  use  of  proteins  as  tectons  is 
not  a  trivial  task.  This  thesis  focuses  on  one  such  protein  system,  Lsm  (like-Sm)  proteins  that 
show  various  levels  of  self-assembly,  exhibit  oligomeric  plasticity  and  have  thermostable 
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versions  available.  These  features  are  essential  to  adapt  Lsm  proteins  as  tectons  for  further 
downstream  applications  in  nanotechnology. 

1.6  Lsm  proteins 

Lsm  proteins  derive  their  name  from  the  previously  characterised  Sm  family  of  proteins 
because  of  their  structural  similarity  69 .  Sm  proteins  are  a  group  of  seven  polypeptides  that  are 
found  associated  with  several  small  nuclear  RNAs  (snRNA).  These  form  heteroheptameric 
complexes  on  binding  to  a  uracil-rich  consensus  region  of  snRNAs  .  These 
ribonucleoprotein  complexes  are  involved  in  mRNA  splicing  and  have  functions  in  post- 
transcriptional  mRNA  modification  in  eukaryotes  71 . 

In  eukaryotes,  Lsm  proteins  assemble  as  heteroheptameric  assemblies  and  assist  in  the  RNA 
processing  events  in  the  nucleus  and  cytoplasm.  Similar  to  Sm  proteins,  Lsm  complexes  were 
named  Lsml  to  Lsm8.  Out  of  these  polypeptides,  Lsm2  to  Lsm8  were  identified  to  be 
associated  with  U6  snRNA  in  the  nucleus  and  found  to  be  necessary  for  stability  of  the 
complex  .  On  the  other  hand,  Lsm  1-7  is  a  cytoplasmic  heteroheptameric  complex  which  is 
involved  in  mRNA  degradation  events.  Lsm  proteins  have  also  been  shown  to  be  involved  in 
processing  of  tRNAs,  small  nucleolar  RNAs  (snoRNAs)  and  pre-rRNAs,  basically  acting  as  a 
chaperone  protein  that  facilitates  efficient  association  of  pre-tRNA  with  their  substrates  73 . 
Figure  1.5  shows  the  location  of  Lsm  proteins  in  the  cell  and  the  various  complexes  formed 
by  Lsm  polypeptides  in  association  with  ribonucleoproteins  (RNPs). 

Lsm  proteins  are  also  present  in  Archaea,  forming  homomeric  complexes  74.  It  is  thought  that 
archaeal  Lsm  proteins  represent  a  primitive  form  of  Sm  proteins  of  eukaryotic  snRNPs.  These 
protein  groups  may  have  arisen  through  gene  duplication  events  followed  by  acquiring  their 
distinct  function  and  cellular  location  75 .  It  is  inferred  that  Lsm  homologues  in  Archaea  bind 
to  small  RNA  and  tRNA  .  Proteins  with  structural  similarity  to  Sm  proteins  are  also  found 
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in  Bacteria.  The  first  protein  to  be  identified  was  Hfq  in  Escherichia  coli,  which  forms  a 

77 

homo-hexameric  ring  complex  .  It  was  found  to  be  involved  in  a  number  of  cellular 
processes  including  the  regulation  of  translocation  of  mRNA  and  its  degradation  78. 
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(B)  Schematic  of  heteroheptamers  of  Lsm  and  Sm  found 
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in  yeast  and  vertebrate  cells 

Depictions  of  JJ 8  snoRNA-associated  complex  and  snR5  snoRNP  where  the  complex  could  be 
a  hexamer  80 .  U7  snRNP  Sm  core  depicting  interaction  of  Sm  and  Lsm  monomers  81 .  Image 
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adaped  from  Wilusz  et  al  (2005)  . 
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1.7  Sequence  and  structure  of  Lsm  monomers 

The  Lsm  polypeptide  sequence  stretches  for  approximately  80  amino  acids  with  the 
molecular  weight  of  proteins  varying  from  8-25  kDa.  The  sequences  are  depicted  in  Figure 
1.6,  within  each,  a  bipartite  consensus  sequence  (designated  Sml  and  Sm2  motifs)  can  be 
identified.  These  motifs  arise  from  strands  pi-p3  and  P4-P5  of  the  core  P-sheet  structure, 
respectively.  A  variable  stretch  of  residues  between  these  conserved  segments  is  created  by  a 
surface-exposed  interconnecting  loop  82.  The  common  feature  of  the  Lsm  monomer  is  five 
strongly  bent  anti-parallel  P-strands  capped  by  a  short  N-terminal  a-helix  83.  Out  of  the  five 
P-strands,  the  first  and  fifth  are  shorter  in  length  as  compared  to  the  second,  third  and  fourth, 
which  are  longer  and  curved.  The  Sml  motif  is  comprised  of  an  N-terminal  a-helix  and  pi-, 
P2-  and  P3-  strands  followed  by  a  variable  loop  sequence,  whereas  the  Sm2  motif  consists  of 
P4-  and  P5-  strands  84.  The  core  of  the  Lsm  monomer  is  hydrophobic  in  nature  and  in  order  to 
bury  this  core,  the  Lsm  monomers  interact  at  the  p4/p5  interface  of  adjacent  subunits,  thereby 
forming  a  closed  ring-shape  architecture  (Figure  1.7C).  The  interface  between  the  subunits 
involves  P-sheet  extension  hydrogen  bonding,  with  reciprocal  interaction  of  P4  of  one  subunit 
with  P5  of  the  neighbouring  subunit.  The  resulting  assembly  is  a  highly  stable  ring  structure 
with  one  face  consisting  of  N-terminal  a-helices  and  the  opposite  face  containing  the  variable 
loop4  region  (Figure  1.7B). 

The  Lsm  P-sheet  is  a  highly  curved  feature  of  the  Lsm  fold,  making  the  monomers  nearly 
elliptical  or  U-shaped  in  cross-section.  The  polypeptide  backbone  can  adopt  this  curvature 
due  to  specific  Glycine  (Gly)  residues  serving  as  pivot  points  to  support  the  shape  of  the  P- 
sheet.  A  characteristic  feature  of  the  monomer  structure  is  the  variation  in  loop4.  This  loop 
links  P3-  and  p4-strands  and  varies  from  just  a  few  residues  in  bacterial  and  archaeal 


16 


Chapter  one :  Introduction 


Mt  Lsma 
Pae  Sml 
Af  Sml 
Af  Sm2 
Pa  Sml 


a-helix  LI  pi  L2  p2  L3  p3 


10 


20 


30 


40 


50 


RV — NVQR-PL-D-ALGNSL — NSPVI IKLKG-DREFRGVLKSFDLHMNLVLNDAEELE-D 

- NVQR-PL-D-ALGNSL — NSPVI IKLKG-DREFRGVLKSFDLHMNLVLNDAEELE-D 

- PR-PL-D-VLNRSL--KSPVIVRLKG-GREFRGTLDG|DIHMNLVLLDAEEIQ-N 

- GAMVL-PN-Q-MV|SMV--GKIIRVEMKGEENQLVGKLEG|DDYMNLYLTNAMECK-G 

- ER-PL-D-VIHRSL--DKDVLVILKK-GFEFRGRLIG1DIHLNVVLADAEMIQ-D 


L4 


P4  L5  p5 


60  70  80 

I  I  I 

GfeVTRRLGTVLIRGDNIVYISP- 

GlEVTRRLGTVLIRGDNIVYISR- 

gevvrkvgsvvirgdtIvfvsp- 

EEKVRSLGEIVLRGNNVVLIQP- 

GEVVKRYGKIVIRGDNVLAISP- 


■A 

■Q 

■T 


Sp  Lsm  - DS-SPNE-FLNKVI  —  GKKVLIR|SS-GVDYKGILSCLDGYXNLALERTEEYVN - GkKTNVYGDAF I RGNNVL YVS A - L 

Sc  Lsm3  — hhmet-pl-d-llklnl--dervyiklrg-artlvgtlqaIdshcnivlsdavetiyqlnne-elseser-rcemvfirgdt|tlist--psavei 

Sp  Lsm4  - GRPILVELKN-GETFNGHLENCDNYXNLTLREVIRTXPD - GDKFFRLPECYIRGNNIKYLRI - 


Sa  Hf q 
Ec  Hf q 
An  Hf q 
Mj  Hf  q 
Bs  Hf q 


MI — ANEN-IQ-DKALENFKANQTEVTVFFLN-GFQMKGVIEEYDKY-VVSLNS- 
- GQS-LQDP-FLNALRRERVPVSIYLVN-GIKLQGQIESFDQF-VILLKN- 

- sl-psir-qlqnlikqaapveiklvt-gdaitgrvlwIdpt-c|ciad- 

- PN-FE — yarrln--gkkvkiflrn-gevldaevtg|sny-eimvkv- 

- n-Iq-DqflnqIRKEnT|yvtvflln-gfqlrgqvkgJdnf-t&'lleS' 


QGKQHLIYKHAISTYTV - E 

- TV S  QMVYKHAI S  T VVP  —  SRPVSH 
--RQTTIWKQAIAYLQP - K 

gdInllvfkhaidyiey - 


EGKQQLIYKHAISTFAPQKNVQLEL 


Cph  Lsm 
Pae  Sm3 


- KNIKIXRLVT-GEDIIGNI-SESQG-LITIKKAFVII-PQPVQLVLSPWQPYTDDK - EIVI 

-FV-A-ELNNLL - GREVQVVLSN-GEVYKGVLHAVDNQLNIVLANASNK - AGEKFN-RVFI 


DDSKIVITITS 

-YRYIVHIDS 


Sml  Sm2 

Figure  1.6\  Structure-based  alignment  of  Lsm  protein  sequences.  Lsm  proteins  are  from  the  following  species  M.  thermoautotrophicum  (Mt), 
A.  julgidus  (Af),  Pa  (P.  abyssi),  P.  aerophilum  (Pae),  S.  cerevisiae  (Sc),  S.  pombe  (Sp),  S.  aureus  (Sa),  E.  coli  (Ec),  Anabena  sp.  (An),  M. 
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janaschii  (Mj),  B.  subtilis  (Bs)  cyanophage  (Cph).  Secondary  structure  assignment  is  based  on  the  crystal  structure  of  Mt  Lsma  . 
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homologs  to  several  residues  in  eukaryotic  homologs.  The  orientation  of  the  residues  towards 
the  outside  on  loop4  make  them  prominent  structural  features  on  the  distal  face  or  loop  face 
of  the  ring  scaffold  (Figure  1.7B).  The  amino  acid  variation  also  modulates  the  RNA-binding 
properties  of  the  Lsm  ring.  These  features  make  loop4  a  strong  target  for  engineering.  Thus, 
the  Lsm  cyclic  oligomer  provides  four  surfaces  i.e.  helix  face,  loop  face,  central  pore,  and 
edge  of  ring,  that  can  be  altered  to  influence  its  higher  order  organisation. 


Figure  1.71  The  Lsm  fold.  Ribbon  structure  of  Lsma  from  Methanobacterium 
thermoautotrophicum  (Mt  Lsma,  PDB  ID  1181 )  74  showing  the  self-assembled  heptameric 
ring  structure.  (A)  Top  view.  (B)  Side  view.  (C)  Interaction  of  f -strands  from  two  Mt  Lsma 
subunits  within  the  heptameric  complex,  Chain  A  is  represented  in  cyan  and  Chain  B  in  blue. 
Residues  involved  in  hydrophobic  packing  at  the  dimer  interface  ( Chain  A:  lle27,  Val77, 
Tyr78  of  chain  A;  Chain  B:  Leu  30,  Phe36,  Leu66,  Val69,  Ile71 )  are  shown  in  stick 
representation. 
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1.8  Oligomeric  plasticity  of  Lsm  rings 

Several  Lsm  proteins  have  been  crystallised,  providing  high-resolution  views  of  the  ring 
morphology  of  the  assembled  molecules.  As  shown  in  Figure  1.8,  the  Lsm  rings  range  from 
6  nm  to  8  nm  in  outer  diameter  and  contain  a  central  cavity  of  0.6  nm  to  1.5  nm.  Since  the 
inherent  function  of  Lsm  protein  is  to  bind  RNA,  some  crystal  structures  have  been  solved  in 
the  presence  of  the  nucleic  acid,  which  reveals  the  precise  residues  involved  in  RNA  binding. 
This  is  an  added  advantage  for  functionalising  the  molecular  tecton  with  nucleic  acid  binding 
properties. 

Despite  the  conservation  in  the  amino  acid  sequence  and  the  three  dimensional  structures, 
Lsm  proteins  exhibit  a  high  degree  of  oligomeric  plasticity.  The  majority  of  crystal  structures 
of  Lsm  are  hexameric  and  heptameric  protein  assemblies,  although  the  recent  discovery  of 
pentameric  and  octameric  forms,  have  broadened  the  spectrum  of  quaternary  structures  of 
Lsm  proteins.  While  these  forms  are  prepared  through  recombinant  means,  it  does  raise  the 
possibility  of  such  oligomers  existing  in  vivo  as  fully  functional  forms,  further  suggesting  the 
presence  of  a  variety  of  multimeric  forms.  Studies  have  shown  that  Lsm  assemblies  may  be 
relatively  dynamic  in  solution,  providing  capacity  to  engage  in  alternative  protein 

O/T 

partnerships  and  other  stable  groupings  . 

Although  the  physicochemical  and  stereochemical  cause  of  this  plasticity  is  unknown,  it 
makes  the  Lsm  system  an  interesting  candidate  for  use  as  a  molecular  building  block  in 
nanotechnology.  Its  series  of  oligomeric  states  (n  =  5, 6, 7, 8...)  provides  various  levels  at 
which  the  molecule  can  be  tuned.  For  example,  to  obtain  different  pore  sizes,  the  inter¬ 
subunit  interface  can  be  engineered  to  potentially  alter  the  oligomeric  state  of  the  ring 
scaffold.  Thus,  the  oligomeric  plasticity  is  an  attractive  quality  of  Lsm  proteins  for  targeting 
it  as  a  modular  building  block. 
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Figure  1.8 1  Selected  crystal  structures  solved  for  Lsm  assemblies.  (A)  Trimer  adopted  by 
Lsm4  fragment,  S.  pombe  (PDB  ID  4EMH).  (B)  Pentamer,  cynanophage  (PDB  3BY7)  6.0  nm 
ring,  0.9  nm  pore.  (C)  Hexamer,  E.  coli  Hfq  ( PDB  1HK9)  6.5  nm  ring,  1.0  nm  pore.  (D) 
Heptamer,  M.  thermoautotrophicum  Lsma  (PDB  1181 )  6.5  nm  ring,  1.5  nm  pore.  (E) 
Octamer,  S.  cerevisiae  Lsm3  (PDB  3BW1 )  7.5  nm  ring,  1.5  nm  pore. 
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1 .9  Higher  order  structures 

Lsm  proteins  also  form  higher  order  oligomers  assembling  either  as  a  14-mer  (in  the  case  of 
the  heptamer)  or  16-mer  (in  the  case  of  the  octamer)  by  stacking  two  rings  together.  They 
may  interact  between  helical  faces  or  between  loop  to  loop  stack  or  by  helix  to  loop  stacking 
87: 88.  Certain  Lsm  proteins  also  show  fibrogenesis  in  the  absence  of  RNA  89.  Although  both 
bacterial  and  archaeal  Lsm  have  been  shown  to  form  fibril  structures,  biophysical 
characterisation  indicates  different  hierarchical  assembly  of  the  two.  The  fibre  models  for 
E.  coli  Hfq  indicate  contacts  between  the  hexameric  protein  rings  mediating  formation  of  a 
protofilament 89  in  a  multilayer  arrangement  where  each  layer  is  made  up  of  six  hexameric 
rings.  Alternatively,  the  crystal  structure  of  Mt  Lsma  reveals  a  tubular  arrangement  of  homo- 
heptamers  via  head  to  tail  stacking  90  in  a  directional  manner.  Figure  1.9  shows  the  various 
quaternary  structures  and  higher  order  organisations  adopted  by  different  Lsm  proteins.  The 
exact  biological  role  of  these  polymers,  if  any,  remains  elusive.  However,  they  are  ideally 
suited  as  self-assembling  proteins  for  bionanotechnology. 

1.10  Yeast  Lsm  protein  -  Lsm3 

Lsm  proteins  found  in  yeast  systems  assemble  as  heteromeric  complexes.  Although  recent 
discoveries  of  homomeric  complexes  in  the  case  of  yeast  SmF,  Lsm3  and  Lsm5  from 
Cerevisiae  parvum,  indicate  that  other  oligomeric  organisations  of  Lsm  proteins  are  possible 
in  non-native  conditions.  The  Lsm  proteins  in  yeast  exist  as  complexes  of  heteroheptamers. 
However,  subunit  three  alone,  Lsm3,  assembles  in  vitro  as  a  stable  ring  of  eight  units  87. 
Lsm3  exhibits  many  typical  features  of  Lsm  proteins,  but  there  are  marked  changes  in  the 
geometry  of  the  p4/p5  interface  between  two  subunits,  which  is  believed  to  play  a  vital  role  in 
determining  the  organisation  of  the  Lsm3  ring  structure  87.  This  relationship  between  the  sub- 
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Figure  1.9\  Higher  order  quaternary  structures  adopted  by  Lsm  proteins.  (A)  Face-to-face 
stacking  seen  in  Lsm3  (PDB  ID  3BW1)  crystal  .  (B)  Head-to-tail  stacking  observed  in  Mt 
Lsma  crystal  packing  (PDB  ID  1181).  (C)  Transmission  EMs  of  E.  coli  Hfq  showing  bundles 
of  unbranched  fibres  91 .  (D)  TEM  showing  polymerisation  Mt  Lsma  into  polar  fibrils  89 .  (E) 
Well  ordered  bundles  formed  by  Pae  Lsm  91 .  (F)  Crystal  lattice  of  Lsm3  showing  interaction 
of  the  rings. 
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unit  interface  angle  and  the  number  of  subunits  forming  the  ring  has  also  been  highlighted  by 
the  crystal  structure  of  the  Lsm5/6/7  complex  of  Shizosaccharomyces  pombe  12 . 

The  crystal  structure  of  Lsm3  shows  the  presence  of  coaxial  stacks  in  which  the  helical  faces 
of  two  rings  interact  by  hydrogen  bonds  and  salt  bridges,  giving  rise  to  a  16-mer. 
Furthermore,  the  C-terminus  of  Lsm3  interacts  with  P-strands  of  other  rings,  as  seen  from  the 
crystal  structure,  giving  rise  to  arrays  87 .  The  C-terminal  residues  of  Lsm3  extend  as  an 
additional  stretch  of  P-strand  within  the  Lsm3  subunits,  which  runs  antiparallel  to  the  twisted 
P3/L4/P4  ribbon  from  a  neighbouring  octamer,  resulting  in  the  formation  of  hydrogen  bonds 
with  residues  of  the  P3 -strand.  This  insertion  results  in  formation  of  an  array  in  the  crystal 
lattice  of  Lsm3. 

1.11  Archaeal  Lsm  proteins 

Sm  homologs  were  discovered  in  Archaea  through  sequence  analysis.  Although  no  definite 
biological  role  is  assigned  to  archaeal  Lsm  proteins,  they  have  been  shown  to  interact  with 
RNA  molecules  in  vitro ,  implicating  a  biological  role  in  RNA  metabolism  and  mediating 
RNA  protein  interactions  ’  .  Archaeal  Lsm  proteins  do  not  possess  an  elongated  N- 

terminus,  C-terminus  or  loop  region.  Sml  and  Sm2  motifs  essentially  occupy  most  of  the 
sequence,  as  opposed  to  other  eukaryotic  Lsm  proteins  that  have  extended  end  terminal 
residues  74:  83 .  An  atypical  structure  has  been  reported  in  Pyrobaculum  aerophilum,  in  which 
the  Sm  domain  extends  up  to  60  residues  at  the  C-terminus  and  it  assembles  as  a  14-mer  88. 
Mt  Lsma,  from  thermophile  M.  thermoautotrophicum,  consists  of  an  81  amino  acid 
polypeptide  sequence,  with  each  monomer  consisting  of  a  highly  hydrophobic  core  due  to  the 
highly  curved  nature  of  P-sheets.  It  self-assembles  as  a  toroid  with  seven  identical  subunits 
having  a  central  pore  diameter  of  1.0- 1.5  nm  74.  Mt  Lsma  does  contain  an  unusually  long  N- 
terminal  sequence  when  compared  with  other  archaeal  Lsm  proteins  with  the  a-helix  starting 
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at  residue  13.  These  extended  N-terminal  residues  are  not  essential  for  ring  formation  74, 
making  this  region  an  ideal  site  for  further  modification  by  genetic  or  chemical  means.  For 
the  purpose  of  this  thesis,  Mt  Lsma  will  be  referred  to  as  Lsma  in  subsequent  sections. 

1.12  Significance  and  potential  outcomes 

The  field  of  bionanotechnology  is  a  thriving  new  area  of  research,  fast  creating  a  niche  in  life 
sciences  and  nanotechnology.  A  bottom-up  approach  for  fabrication  of  nanodevices  offers 
more  control  and  precision  on  the  nanometre  scale.  Producing  complex  nanoscale  devices 
presents  many  challenges,  as  the  components  of  such  devices  require  special  properties. 
Biomolecules  offer  unique  features  and  advantages  from  the  assembly  standpoint.  With  self¬ 
assembling  biological  systems,  it  is  now  possible  to  create  elaborate  patterns  and  arrays  on 
surfaces  with  better  control  over  the  dimension  and  position  of  the  components  .  To 
customise  the  fabrication  of  molecular  biomaterials,  it  is  necessary  to  understand  the  process 
of  self-assembly  of  the  biomolecular  units  into  their  higher  order  states.  By  choosing  a 
protein  system  as  a  module  for  the  development  of  molecular  tectons,  this  thesis  aims  to 
provide  a  new  protein  tecton,  as  well  as  useful  insight  into  the  assembly  process  and  present 
strategies  to  control  the  supramolecular  organisation  of  these  protein  nanocomponents. 

1.13  Scope  of  the  thesis 

Lsm  proteins  readily  form  stable  ring  structures  with  specific  binding  capacity  for  RNA.  With 
different  oligomeric  states  present,  available  crystal  structures,  and  thermostable  versions  to 
choose  from,  Lsm  proteins  could  be  customised  into  new  componentary  suitable  for 
fabrication  of  new  biomaterial.  Therefore,  Lsm  proteins  were  chosen  as  potential  candidates 
to  fabricate  novel  biological  nanostructures,  with  its  readily  accessible  sites  for  addition  of 
functional  moieties  and  potential  to  tune  the  pore  size. 
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In  this  project,  Lsm  proteins  have  been  investigated  as  building  blocks  to  create  novel 
nanostructures.  The  study  focused  primarily  on  a  thermophilic  protein  from  M. 
thermoautotrophicum,  which  assembles  as  a  heptamer.  The  Lsm  proteins  were  well  studied  in 
biology  due  to  their  involvement  in  a  plethora  of  RNA  processing  events  in  the  cell.  Atomic 
level  resolution  was  provided  by  the  crystal  structures  to  elucidate  the  precise  structural 
features  of  Lsm  proteins.  Although  Sm/Lsm  proteins  have  been  studied  for  their  biochemical 
and  evolutionary  structural  properties,  their  characteristic  ring-shaped  architecture  lends  itself 
to  the  fabrication  of  novel  nanostructures  that,  to  date,  had  not  been  explored. 

The  specific  aims  of  the  research  experiments  were: 

i.  To  examine  the  behaviour  of  Lsm3  and  Lsma  and  determine  the  system  with 
properties  better  suited  for  use  as  a  tecton. 

ii.  To  carry  out  site-directed  mutagenesis  to  explore  the  interface  between  the  monomers 
of  the  Lsma  ring  module. 

iii.  To  engineer  the  cyclic  oligomers  into  higher  order  organisations  and  determine  their 
structure  by  physical  methods. 

iv.  To  characterise  the  nature  of  the  quaternary  structure  of  the  discrete  species 
engineered  and  the  interface  muteins  formed  in  solution. 
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1.14  Thesis  overview 

Chapter  2  outlines  the  experimental  procedures  used  throughout  this  body  of  work. 

Chapter  3  details  the  design  and  strategies  used  for  investigation  of  Lsma  subunit  interface.  It 
discusses  the  mutations  generated  and  the  potential  change  they  induced  in  the  oligomeric 
state  of  Lsma.  The  chapter  also  discusses  the  trial  of  Lsm3  protein  from  a  eukaryotic  source. 

Chapter  4  presents  the  strategy  to  induce  higher  order  forms  of  the  circular  oligomer  as 
nanotubes  and  discusses  methods  to  control  its  organisation  to  achieve  a  new  hierarchical 
level  of  assembly. 

Chapter  5  focuses  on  the  surface  modification  of  the  Lsma  scaffold,  detailing  the  biophysical 
characterisation  of  the  variants  generated.  It  entails  the  investigation  of  the  Lsma 
supramolecular  structure  and  its  suitability  to  be  used  as  a  molecular  tecton. 

Chapter  6  summarises  the  major  findings  of  this  thesis,  and  future  directions  to  be  pursued 
from  this  work. 
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2  Chapter  two 
Materials  and  methods 

2.1  Introduction 

This  chapter  begins  with  an  overview  of  the  materials  used  in  the  experiments.  A  wide 
variety  of  techniques  from  varied  disciplines  have  been  used  ranging  from  general 
microbiology  to  small  angle  X-ray  scattering  at  the  Australian  Synchrotron.  The  laboratory 
texts  authored  by  Maniatis  et  al.  1  were  invaluable  for  routine  manipulation  of  DNA  plasmids 
and  bacterial  strains.  G  E  Healthcare  prescribed  protein  purification  techniques  were  used  as 
a  guide  to  design  the  protein  purification  protocols.  Also  used  were  the  previous  methods 
developed  in  the  Protein  Structure  lab,  Macquarie  University,  Sydney. 

2.2  Materials  and  equipment 
2.2.1  Reagents 

Unless  otherwise  stated,  chemicals  were  purchased  from  Aldrich  Chemicals  or  Sigma 
Chemical  Company  Ltd.  Media  for  growing  bacterial  cultures  were  purchased  from  Life 
Technologies  (Christchurch,  New  Zealand).  General  reagents  used  are  summarised  in  Table 
2.1.  Purified  water  from  a  MilliQ  system  (Millipore)  was  used  throughout.  Sodium  dodecyl 
sulfate  polyacrylamide  (SDS-PAGE)  gels  and  protein  ladders  were  purchased  from  Life 
Technologies.  Column  chromatography  media  were  purchased  from  G  E  Healthcare  Life 
Sciences.  The  centrifuges  used  were  Eppendorf  5810R  (fixed  angle  rotor  F-34-6-38)  and 
Sorvall  RC  6  plus  (rotor  F10S6x500Y). 
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Table  2.11  Reagents  used  in  this  work  listed  with  commercial  sources. 


Item 

Supplier 

acetic  acid,  glacial 

Ajax  Finechem 

agar  (bacteriological) 

Life  Technologies 

ampicillin 

Life  Technologies 

ammonium  sulfate 

Astral 

Benchmark  protein  ladder 

Life  Technologies 

bromophenol  blue 

Progen 

calcium  chloride 

BDH 

chloramphenicol 

BDH 

Coomassie  brilliant  blue 

BDH 

dimethylformamide 

Astral 

dithiothreitol 

BDH 

DNase  I 

Astral 

glucose 

Astral 

glycerol 

AppliChem 

4-(2-hydroxyethyl)- 1  - 
piperazineethanesulfonic  acid 

Astral 

hydrochloric  acid 

Ajax  Finechem 

imidazole 

AppliChem 

isopropyl-P-D-thiogalactopyranoside  (IPTG) 

Astral 

lactose 

Merck 

lysozyme 

Astral 

magnesium  sulfate 

AppliChem 

potassium  chloride 

BDH 

potassium  phosphate 

Astral 

protease  inhibitor  cocktail 

Sigma-Aldrich 

RNase  A 

Astral 
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sodium  acetate 

Ajax  Finechem 

sodium  chloride 

Ajax  Finechem 

sodium  dodecyl  sulfate 

AppliChem 

sodium  hydroxide  pellets 

BDH 

sodium  phosphate 

Astral 

tetramethylethylenediamine 

Bio-Rad 

tris(hydroxymethyl)aminomethane 

Astral 

bacto-tryptone 

Oxoid 

urea 

Merck 

yeast  extract 

Life  Technologies 

2.2.2  Growth  media  and  buffers 

Luria-Bertani  (LB)  -  To  1  L  dH20  was  added  bacto-tryptone  (10  g),  yeast  extract  (5  g)  and 
NaCl  (10  g)  and  sterilised  by  autoclaving.  LB  plates  were  prepared  by  using  15  g/L  agar. 

Autoinduction  media  -  Bacto-tryptone  (10  g)  and  yeast  extract  (5  g)  was  added  to  925  mL 
dHoO.  The  media  were  sterilised  by  autoclave  following  which  MgS04  (1  M,  1  mL), 
50x5052  media  (20  mL)  and  20xNPS  (50  mL)  were  added  (all  autoclaved  separately  prior  to 
addition).  [50x5052  media  -  glycerol  (250  g),  glucose  (25  g),  a-lactose  (100  g)  in  730  mL 
dH20] ;  [20xNPS  pH  6.75  -  (NH4)2S04  (66  g),  KH2P04  (136  g),  Na2HP04  (142  g)  added  to 
900  mL  ddH20]. 

SOC  media  -  Bacto-tryptone  (20  g),  yeast  extract  (5  g),  NaCl  (0.58  g),  KC1  (0.18  g),  MgCl2 
(0.95  g),  MgS04  (1.20  g)  were  added  to  1  L  dH20.  20%  w/v  glucose  was  added  after 
autoclaving. 
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Table  2.21  Composition  of  growth  media. 


Buffer 

LB  medium 
ZY  medium 
50x5052 
20xNPS 
SOC  medium 


Composition 

bacto-tryptone  (10  g/L),  yeast  extract  (5  g/L),  NaCl  (10  g/L) 

bacto-tryptone  (10  g/L),  yeast  extract  (5  g/L) 

glycerol  (250  g),  glucose  (25  g),  a-lactose  (100  g) 

(NH4)2S04  (66  g/L),  KH2P04  (136  g/L),  Na2HP04  (142  g/L) 

bacto-tryptone  (20  mg/mL),  yeast  extract  (5  mg/mL),  NaCl  (0.58  g/L), 
KC1  (0.95  g/L),  MgCl2  (1.20  g/L),  MgSC>4  (1.20  g/L),  glucose  (3.6  g/L) 


Table  2.31  Buffers  used  at  various  stages  of  protein  purification. 

Process 

Buffer 

Composition 

4-> 

Buffer  A 

20  mM  Tris/HCl,  pH  8.0,  200  mM  NaCl,  2%  glycerol. 

J-l-H 

Buffer  B 

20  mM  Tris/HCl,  pH  8.0,  200  mM  NaCl,  2%  glycerol,  10  mM 
imidazole. 

§  Sb 

s § 

<D  S 
.2  § 

Buffer  C 

20  mM  Tris/HCl,  pH  8.0,  200  mM  NaCl,  2%  glycerol,  500  mM 
imidazole. 

£  4b 
o  u 

a 

a 

HH 

lysis  buffer 

20  mM  Tris/HCl,  pH  8.0,  200  mM  NaCl,  2%  glycerol,  DNase  (10 
pg/mL),  RNase  (100  pg/mL),  lysozyme  (1  mg/mL),  protease 
inhibitor  cocktail. 

o 

U 


T3 

<d 

bO 

bO 

cb 


H 


c/3 

a 


Buffer  D 

PBS,  pH  7.3  (140  mM  NaCl,  2.7  mM  KC1,  10  mM  Na2HP04, 
mM  KH2P04,  pH  7.3). 

Buffer  E 

50  mM  Tris/HCl,  10  mM  reduced  glutathione,  pH  8.0. 

Buffer  F 

PBS,  pH  7.3  (140  mM  NaCl,  2.7  mM  KC1,  10  mM  Na2HP04, 
mM  KH2P04,  pH  7.3),  1  mM  TCEP. 

Buffer  G 

PBS,  pH  8  (140  mM  NaCl,  2.7  mM  KC1,  10  mM  Na2HP04, 
mM  KH2P04,  pH  7.3). 

Wash  buffer  1 

PBS,  pH  7.3,  0.1  %  Triton  X-100. 

Wash  buffer  2 

PBS,  pH  7.3,  1  M  NaCl. 
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2.2.3  Expression  hosts  and  plasmids 

The  bacterial  strains  trialled  and  used  in  this  work  have  been  listed  in  Table  2.4.  Hfq- 
deficient  strain,  (named  MRE5)  was  a  gift  to  the  Macquarie  University  Protein  Structure 
Group  from  Jean  Beggs  (University  of  Edinburgh). 


Table  2.41  Bacterial  strains  used  to  generate  protein  material. 


E.  coli  strain 

Use 

Source 

BL21 

expression  host 

Life  Technologies 

BL21  (DE3) 

expression  host 

Life  Technologies 

BL21  (DE3)  pLysS 

expression  host 

Macquarie  University 

TOPIO 

plasmid  isolation 

Life  Technologies 

MRE5Ahfq 

expression  host 

Jean  Beggs 

Antibiotics  used  in  the  media  were  filter  sterilised  and  stored  as  lOx  stock  solutions  at  -20  °C. 

The  final  working  concentrations  (Table  2.5)  were  used  for  the  selection  of  an  appropriate 

bacterial  strain. 

Table  2.51  List  of  antibiotics  used  as  selection  markers  for  the  bacterial  strains. 

Antibiotic 

Stock  (mg/mL) 

Working  (pg/mL) 

Ampicillin 

100 

100 

Chloramphenicol 

30 

30 

Kanamycin 

50 

50 

The  yeast  Lsm3  gene  was  cloned  in  vector  pCL774  by  Nishen  Naidoo  2,  Protein  Structure 


Group,  Macquarie  University,  and  transformed  in  BL21(DE3)  pLysS  strain  of  E.  coli.  For  the 
purpose  of  this  study,  the  material  was  obtained  as  a  bacterial  glycerol  stock  and  stored  at  -80 
°C.  The  Lsma  full  length  gene  was  synthesised  by  Epoch  Biolabs  (Sugar  Land,  Texas,  USA). 
The  genes  were  cloned  into  pET24a  and  pGEX-4T-2  vectors,  containing  a  His6-tag  and  GST- 
tag,  respectively.  The  full  length  gene  and  the  other  sequences  used  to  generate  Lsma 
constructs  are  shown  in  Table  2.6. 
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Table  2.61  Sequences  of  Lsm  proteins  used  for  this  work. 


Gene 

Number 
of  amino 
acids 

Sequence 

yeast  Lsm3 

96 

MHHHHHHMETPLDLLKLNLDERVYIKLRGARTLVGTLQAFDSHC 

NIVLSDAVETIYQLNNEELSESERRCEMVFIRGDTV 

TLISTPSEDDDGAVEI 

Lsma 

83 

MSVIDVSSQRVNVQRPLDALGNSLNSPVIIKLKGDR 

EFRGVLKSFDLHMNLVLNDAEELEDGEVTRRLGTVLIRGDNIVY 

ISP 

His6-  Lsma 

99 

MH6ENLYFQGGSMS VI DVS  SQRVNVQRPL DAL GNSLNSPVIIKL 
KGDREFRGVLKSFDLHMNLVLNDAEELEDGEVTRRLGTVLIRGD 

NIVYISP 

GST 

cleaved 

Lsma 

85 

GSMSVIDVSSQRVNVQRPLDALGNSLNSPVIIKLKGDR 

EFRGVLKSFDLHMNLVLNDAEELEDGEVTRRLGTVLIRGDNIVY 

ISP 

2.2.4  Chromatography  equipment,  media  and  columns 

Chromatography  operations  were  performed  on  an  AKTA  Explorer  system  (G  E  Healthcare). 
Hi-Trap  Chelating  columns  (1  mL,  G  E  Healthcare)  and  HisTrap  columns  (1  mL,  G  E 
Healthcare)  were  used  for  immobilised  metal  affinity  chromatography  (IMAC)  precharged 
with  different  metal  ions  (Section  2.5.2).  For  analytical  and  preparative  size  exclusion 
chromatography,  Superdex  200  GL  10/300  (G  E  Healthcare)  columns  (24  mL)  and  HiLoad 
16/600  Superdex  200  pg  (G  E  Healthcare)  columns  (120  mL)  were  employed,  respectively. 

2.3  Molecular  biology 
2.3.1  Storage  of  bacterial  strains 

Glycerol  stocks  were  prepared  in  order  to  store  E.  coli  strains  indefinitely.  The  cells  were 
grown  in  10  mL  of  LB  media  containing  the  appropriate  antibiotics  for  16  h  at  37  °C.  0.5  mL 
of  resulting  cell  suspension  was  mixed  with  0.5  mL  of  sterilised  50%  glycerol.  The  cells  were 
snap  frozen  in  liquid  nitrogen  and  stored  at  -80  °C. 
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2.3.2  Plasmid  isolation  from  E.  coli 

Lsma  expression  plasmids  and  pRARE  constructs  were  isolated  from  bacterial  cells  using  a 

Q 

commercial  kit  (QIAprep  Spin  Miniprep  kit,  QIAGEN)  according  to  the  manufacturer’s 
instructions.  The  plasmids  were  collected  in  the  supplied  elution  buffer  and  stored  at  -20  °C 
in  20  pL  aliquots. 

2.3.3  Competent  cell  preparation 

A  modified  Inoue  method  4  was  used  to  prepare  competent  cells  for  this  work.  A  5  mL  starter 
culture  (5  mL  LB  with  20  mM  MgSCL)  was  grown  overnight  with  shaking  at  180  rpm  at  37 
°C.  This  starter  culture  was  used  to  inoculate  250  mL  of  LB  broth  containing  20  mM  MgSCL 
in  a  2  L  conical  flask.  The  culture  was  incubated  with  shaking  at  180  rpm  at  26  °C  for  4  h 
until  the  optical  density  at  600  nm  (OD600)  reached  0.5.  After  cooling  on  ice  for  10  min,  the 
cells  were  gently  centrifuged  at  4000  rpm,  10  min,  4  °C.  The  supernatant  was  discarded  and 
the  cells  resuspended  in  ice-cold  TB  buffer  (80  mL,  containing  250  mM  KCI2,  15  mM  CaCl2, 
10  mM  PIPES  pH  6.7,  55  mM  MnCh)  and  centrifuged  as  before  and  the  supernatant 
removed.  The  cells  were  resuspended  in  ice  cold  TB  buffer  (20  mL)  for  a  second  time  with 
addition  of  1.5  mL  DMSO  followed  by  incubation  on  ice  for  10  min.  Aliquots  of  50  pL  were 
made  and  were  used  immediately  or  snap  frozen  in  liquid  nitrogen  before  storing  at  -80  °C. 

2.3.4  Bacterial  transformation  for  plasmid  propagation 

Plasmid  DNA  was  propagated  and  maintained  in  commercially  available  E.  coli  TOP  10  cells 
(Life  Technologies).  2  pL  of  plasmid  DNA  was  added  to  a  thawed  50  pL  aliquot  of 
competent  cells  and  incubated  on  ice  for  30  min.  The  cells  were  subjected  to  heat  shock  in  a 
digital  heat  block  at  42  °C  for  precisely  35  seconds  after  which  they  were  immediately 
transferred  to  ice.  250  pL  of  room  temperature  SOC  media  was  added  and  incubated  at  37  °C 
for  60  min  with  shaking  at  140  rpm.  50  pL  of  the  culture  was  spread  on  a  pre- warmed  LB 
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agar  plate  containing  appropriate  antibiotics  and  incubated  at  37  °C  for  12-16  h.  The  colonies 
from  the  resulting  cultures  were  used  to  prepare  glycerol  stocks  or  for  subsequent  plasmid 
preparation  (Section  2.3.2). 

2.4  Protein  expression 

2.4.1  Transformation  for  protein  expression 

pET24a  and  pGEX-4T-2  vectors  containing  the  Lsma  gene  was  co-transformed  with  pRARE 
plasmid  into  a  commercially  available  BL21(DE3)  strain.  Aliquots  (2  pL)  of  Lsma 
expression  plasmids  and  pRARE  were  added  to  the  thawed  bacterial  cells  and  were 
transformed  as  described  in  Section  2.3.4.  The  glycerol  stocks  of  the  expression  strains  were 
made  (Section  2.3.1)  and  used  to  inoculate  expression  cultures. 

2.4.2  Small-scale  expression  for  screening  of  transformed  colonies 

To  determine  the  optimum  incubation  time  and  temperature  for  protein  expression,  small 
scale  expression  trials  were  carried  out.  Two  methods  of  induction  were  trialled  using 
isopropyl  P-D-l-thiogalactopyranoside  (IPTG)  and  auto-induction  allowing  the  expression  of 
the  recombinant  proteins  5. 

2.4.2.1  IPTG  induced  protein  expression 

Colonies  from  LB  agar  plates  were  used  to  inoculate  LB  media  (10  mL)  containing 
appropriate  antibiotics.  Suspension  cultures  were  grown  in  Falcon  tubes  (50  mL)  to  an  OD60o 
of  0.4-0.6  before  IPTG  was  added  (0.2  mM  or  1  mM).  To  assess  the  effect  of  temperature  on 
protein  expression,  the  cultures  were  subsequently  incubated  at  26  °C  or  37  °C,  with  shaking 
at  180  rpm.  500  pL  samples  were  taken  at  various  time  points  and  centrifuged  (12,000  rpm,  5 
min)  and  the  supernatant  removed.  80  pL  of  the  SDS-PAGE  buffer  was  added  to  the  pellets 
which  were  subsequently  boiled  for  5  min  to  lyse  the  cells.  10  pL  was  loaded  onto  an  SDS- 
PAGE  gel  (Section  2.6.3). 
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2.4.2.2  Protein  expression  by  auto-induction 

In  a  similar  way  to  the  IPTG  induction  protocol,  colonies  from  LB  agar  plates  were  used  to 
inoculate  auto-induction  media  (10  mL)  containing  antibiotics  and  grown  at  26  °C  and  37  °C 
with  shaking  at  180  rpm.  Samples  were  collected  as  described  above. 

2.4.3  Large-scale  expression  of  protein 

For  the  production  of  recombinant  proteins,  cells  were  streaked  on  LB  agar  plates  containing 
appropriate  antibiotics  and  incubated  overnight  at  37  °C.  A  single  colony  was  used  to 
inoculate  10  mL  LB  broth  (with  antibiotics)  and  incubated  overnight  (37  °C,  180  rpm)  as  a 
starter  culture. 

2.4.3. 1  IPTG  induced  protein  expression 

The  starter  culture  was  used  to  inoculate  200-400  mL  of  LB  media  in  a  2  L  baffled  flask. 
Following  incubation  at  37  °C  with  shaking  at  180  rpm  until  the  ODgoo  reached  0.4-0.6, 
protein  expression  was  induced  with  0.2  mM  IPTG.  The  culture  was  transferred  to  26  °C  and 
cells  grown  overnight  with  shaking  at  180  rpm.  Cells  were  further  harvested  by  centrifugation 
at  4000  rpm  for  10  min  and  re-suspended  in  20  mL  of  buffer  B.  The  cell  suspension  was  used 
directly  to  purify  the  recombinant  protein  or  else  snap  frozen  in  liquid  nitrogen  and  stored  at 
-80  °C. 

2.4.3.2  Protein  expression  by  auto-induction 

A  starter  culture  was  used  to  inoculate  500  mL  of  ZY-rich  media  (Table  2.2)  which  was  then 
incubated  at  26  °C  with  shaking  at  180  rpm  for  22  h.  Cells  were  further  harvested  by 
centrifugation  at  4000  rpm  for  10  min  and  re-suspended  in  20  mL  of  buffer  B.  The  cell 
suspension  was  used  directly  to  purify  the  recombinant  protein  or  else  snap  frozen  in  liquid 
nitrogen  and  stored  at  -80  °C. 
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2.5  Protein  purification 

2.5.1  Protein  extraction 

2. 5. 1.1  Sonication 

Sonication  uses  a  small  metal  probe  oscillating  at  ultrasonic  frequency  generating  localised 
low  pressure  and  membrane  disruption  through  cavitation.  Lysis  was  carried  out  using  a 
Sonicator  3000  (Misonix)  equipped  with  a  Microprobe  (Misonix)  for  volumes  less  than  5  mL, 
and  a  standard  probe  for  larger  volumes  (30  mL).  The  cell  suspension  was  prepared  in  a  50 
mL  plastic  beaker  and  placed  in  an  ice  slurry  to  prevent  excessive  heating  of  the  sample.  The 
cells  were  sonicated  at  60%  amplitude  with  a  pulse  rate  of  3  s  on  and  10  s  off  for  a  total  of  20 
min. 

2.5. 1.2  Higher  pressure  homogenisation 

Cell  lysis  by  homogenisation  is  achieved  when  a  sample  is  passed  through  a  needle  valve  at 
high  pressure  causing  cell  disruption  by  shear  stresses  and  decompression  as  cells  return  to 
normal  atmospheric  pressure.  Volumes  larger  than  30  mL  were  lysed  by  passing  the  sample 
through  a  Microfluidics  M-110P  cell  disruptor  at  17,000  psi  at  4  °C.  The  cycle  was  repeated 
three  times  to  ensure  complete  lysis. 

The  cell  debris  from  the  lysed  samples  was  cleared  by  centrifugation  at  15,000  rpm  for  25 
min  at  4  °C.  The  supernatant  contained  the  soluble  recombinant  protein,  which  was  subjected 
to  downstream  chromatography  steps  for  purification. 
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2.5.2  Immobilised  affinity  chromatography  for  HiS6-tagged  proteins  (IMAC) 

IMAC  was  used  to  purify  the  His6-tagged  proteins  using  the  pre-packed  column  mentioned  in 
Section  2.2.4,  The  clarified  lysate  was  filtered  through  a  0.22  pm  syringe  filter  and  applied  to 
a  1  mL  Hi-Trap  Chelating  column  pre-equilibrated  in  buffer  B  [10  column  volumes  (CV)] 
using  a  Gilson  peristaltic  pump.  Where  appropriate,  a  HisTrap  (G  E  Healthcare)  column  was 
used  with  the  same  protocol.  The  column  was  washed  with  10  CV  of  buffer  C  to  remove  any 
unbound  protein.  Pure  protein  was  eluted  with  a  gradient  set  to  100%  buffer  C  on  an  AKTA 
Explorer  system  (G  E  Healthcare)  and  the  eluted  fractions  stored  at  4  °C  or  snap  frozen  and 
stored  at  -80  °C  for  later  use.  All  flow  rates  were  maintained  at  1  mL/min.  To  optimise  the 
affinity  chromatography,  different  transition  metals  were  trialled  to  check  the  maximum 
affinity  of  the  proteins  for  purification.  The  Hi-Trap  Chelating  column  was  charged  with  100 
mM  C0SO4,  100  mM  ZnS04,  100  mM  NiS04  and  100  mM  CUSO4  for  each  trial.  The  sample 
fractions  were  assessed  by  SDS-PAGE  (Section  2.6.3). 


Binding  |  Washing  |  Elution 


Figure  2.1\  Typical  trace  of  immobilised  metal  affinity  chromatography.  The  binding, 
washing  and  elution  steps  are  shown  above  the  graph.  Absorbance  at  280  nm  (mAU)  is 
shown  in  black  and  concentration  of  imidazole  is  in  red. 
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2.5.3  Purification  of  GST-fusion  proteins 

Batch  purification  was  employed  to  purify  the  GST-tagged  proteins.  The  binding  of  GST- 
tagged  proteins  depends  on  the  size,  conformation  and  concentration  of  the  protein  in  the 
sample  being  loaded.  Binding  of  GST  to  glutathione  is  also  flow  dependent,  and  lower  flow 
rates  often  increase  the  binding  capacity.  Therefore,  the  increased  incubation  time  in  a  batch 
purification  mode  increases  the  binding  of  the  recombinant  protein  to  the  matrix  for  efficient 
separation  from  other  soluble  protein  in  the  cell  lysate. 

Glutathione  Sepharose  4B  (G  E  Healthcare)  was  prepared  by  centrifugation  at  600  rpm  for  2 
min  and  the  storage  buffer  removed  by  decanting  the  supernatant.  The  resin  was  equilibrated 
by  re-suspending  the  pelleted  resin  in  buffer  D,  re-pelleting  by  centrifugation  at  600  rpm  for  2 
min  and  removing  the  supernatant.  This  process  was  repeated  thrice  before  adding  buffer  G 
to  the  pelleted  resin.  Approximately  2  mL  of  the  prepared  slurry  was  incubated  with  soluble 
lysate,  extracted  from  400  mL  of  bacterial  culture  (Section  2.5.1),  for  90  min  at  room 
temperature  with  gentle  shaking.  The  slurry  was  then  centrifuged  at  600  rpm  for  5  min  to 
pellet  the  resin  with  bound  protein.  The  supernatant  was  discarded  and  the  pelleted  resin  was 
gently  transferred  to  a  PD- 10  gravity  column  (G  E  Healthcare).  Unbound  protein  was 
removed  by  washing  with  wash  buffer  1  and  wash  buffer  2  (5  CV).  The  slurry  was 
equilibrated  with  buffer  E  before  adding  80  pL  of  thrombin  (1  U)  for  16  h  at  room 
temperature.  The  cleaved  protein  was  eluted  in  buffer  D  (3  CV).  Generally  the  eluted  proteins 
were  further  purified  by  preparative  SEC  (Section  2.5.4)  using  buffer  G.  The  scheme  of 
purification  using  glutathione  Sepharose  4B  is  shown  in  Figure  2.2,  with  on-column  cleaving 
of  the  GST-tag  using  thrombin. 
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Figure  2.21  Schematic  of  affinity  purification  using  GST-tag.  The  GST  fused  recombinant 
protein  was  bound  to  the  glutathione  matrix  and  was  digested  by  thrombin  yielding  pure 
protein  after  elution.  This  purification  scheme  was  utilised  for  the  purification  of  Lsma  and 
its  muteins. 
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2.5.4  Preparative  size  exclusion  chromatography  (SEC) 

All  the  Lsm  protein  samples  were  further  subjected  to  preparative  SEC  in  order  study  the 
oligomeric  distribution  of  the  preparations.  Lsm  samples  (~3  mL)  were  applied  to  a  HiLoad 
16/600  Superdex  200  pg  column  in  appropriate  buffer  at  a  flow  rate  of  1  mL/min.  The  eluted 
protein  fractions  were  collected  as  0.5  mL  fractions  and  relevant  fractions  were  pooled. 

2.6  Protein  analysis 

2.6.1  Protein  concentration 

Purified  protein  samples  were  analysed  by  UV  absorption  at  280  nm  on  the  NanoDrop  2000. 
Extinction  coefficients  of  the  protein  samples  were  calculated  from  amino  acid  sequence 
using  the  ExPasy  online  bioinformatics  server  6  and  were  used  to  determine  their 
concentration  using  the  relation: 

Concentration  =  *  1 

Where  A28o  is  the  absorption  at  280  nm  in  AU,  8280  is  the  extinction  coefficient  at  280  nm 
(M1  cm'1),  and  1  is  the  path  length  (1  mm  in  case  of  NanoDrop  2000). 

2.6.2  Buffer  exchange 

(i)  Buffer  exchange  by  dialysis 

Protein  samples  were  transferred  into  dialysis  tubing  (10,000  MWCO),  and  dialysed  against 
the  required  buffer.  Sample  to  dialysis  solution  ratios  would  typically  be  1:2000.  Dialysis  was 
carried  out  at  4°C  with  stirring  to  increase  the  rate  of  exchange.  Typically,  this  would  be 
repeated  to  achieve  complete  exchange. 
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(ii)  Buffer  exchange  by  chromatography 

5  mL  HiTrap  Desalting  columns  (G  E  Healthcare)  were  purchased  pre-packed  with  Sephadex 
G-25  Superfine  cross-linked  dextran.  These  employ  the  same  principles  as  gel  filtration  to 
separate  molecules  with  a  mass  larger  than  5  kDa  from  those  with  a  mass  below  1  kDa, 
thereby  separating  proteins  from  buffer  salts.  The  column  was  connected  to  the  AKTA 
Explorer  system,  and  equilibrated  with  3  column  volumes  of  the  required  buffer  at  2  mL/min. 
The  sample  was  loaded  onto  the  column  at  a  maximum  volume  of  1  mL.  Protein  was  eluted 
at  2  mL/min  using  the  required  buffer,  0.5  mL  fractions  collected  and  the  elution  of  protein 
monitored  by  UV  absorbance  at  280  nm. 

(iii)  Buffer  exchange  by  dialysis  devices 

For  faster  buffer  exchange  of  smaller  volumes  of  samples  (<  1  mL),  Slide- A-Lyzer  Dialysis 
Devices  (Thermo  Fisher  Scientific)  were  used  with  a  7000  MWCO.  100-500  pL  of  the 
protein  sample  was  pipetted  into  the  dialysis  cup  partially  submerged  in  the  appropriate 
buffer.  Dialysis  was  carried  out  at  room  temperature  or  4°C,  with  stirring  to  increase  the  rate 
of  exchange. 

2.6.3  Sodium  dodecyl  sulfate  polyacrylamide  gel  electrophoresis  (SDS-PAGE) 

SDS-PAGE  is  the  most  common  technique  to  check  the  purity  of  protein  samples.  It 
completely  denatures  proteins  in  the  presence  of  an  ionic  detergent,  SDS,  breaking  the 
hydrogen  bonds  and  coating  the  entire  length  of  protein  giving  it  a  negative  charge.  This 
makes  the  migration  of  protein  through  the  gel  uniform.  In  the  presence  of  reducing  agents, 
dithiothreitol  (DTT)  or  P-mercaptopethanol,  the  disulfide  bonds  are  completely  disrupted, 
thus  completely  denaturing  the  protein  molecule.  The  samples  were  normally  heated  to  70  - 
90  °C  for  5  min  before  loading  onto  the  gel.  For  SDS-PAGE,  3  pL  of  Novex®  Sharp  protein 
ladder  (Life  Technologies)  was  used  as  a  molecular  weight  marker.  NuPAGE®  4-12  %  Bis- 
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Tris  gels  were  run  according  to  manufacturer’s  recommendations  in  a  NuPAGE®  gel 
electrophoresis  box  at  room  temperature.  The  protein  samples  were  mixed  with  reducing 
agent  and  4xlithium  salt  of  dodecyl  sulfate  (LDS)  sample  buffer  or  2xTris-Gly  sample  buffer 
depending  upon  the  nature  of  gel  to  be  run.  The  samples  were  typically  boiled  for  5  min  when 
dealing  with  thermostable  protein.  The  gels  were  stained  with  Simply  blue  stain  using  a  three 
step  protocol.  The  gels  were  placed  in  distilled  water  after  electrophoresis  and  microwaved 
for  20  s  to  remove  the  SDS  present  in  the  running  buffer.  It  was  placed  in  stain  for  1  h,  until 
protein  bands  appeared.  To  destain  the  gel,  it  was  placed  in  distilled  water  for  1  h. 


Table  2.71  Buffer  composition  used  in  gel  electrophoresis. 


Solution 

MES  running  buffer 
LDS  loading  dye 


Reducing  agent 
Tris-Gly  loading  dye 

Simply  Blue  stain 


Content 

50  mM  Tris-MES,  pH  7.6,  0.1%  SDS,  1  mM  EDTA. 

564  mM  Tris  base,  424  mM  Tris/HCl,  40%  glycerol,  8%  LDS, 
2.04  mM  EDTA,  0.88  mM  Coomassie  Blue  G-250,  0.7  mM 
phenol  red,  pH  8.5. 

1  M  P-mercaptoethanol. 

126  mM  Tris/HCl  pH  6.8,  20%  glycerol,  0.005%  bromophenol 
blue. 

80  mg  Coomassie  Brilliant  Blue  (CBB)  dissolved  in  1  L  MilliQ 
water  with  35  mM  HC1. 
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2.6.4  Analytical  size  exclusion  chromatography 

Size  exclusion  chromatography  (SEC)  separates  macromolecules  according  to  their 
hydrodynamic  volume  which  is  defined  by  the  Stokes  radius  7 .  The  column  consists  of  porous 
polymer  beads  designed  to  have  pores  of  different  sizes.  The  particles  with  smaller 
hydrodynamic  volumes  equilibrate  into  these  pores  when  a  mobile  phase  is  passed  through 
the  column.  Therefore,  smaller  particles  would  have  a  longer  path  length  than  larger  particles 
and  would  be  separated  along  the  length  of  the  column  as  their  flow  would  be  retarded  8. 

Although  a  low  resolution  technique,  SEC  is  frequently  used  as  a  final  step  in  protein 
purification  and  for  discerning  between  high  and  low  molecular  weight  components.  It  is  an 
important  means  to  assess  the  homogeneity  of  the  sample  as  aggregated  material  can  be 
easily  separated  from  a  folded  protein.  Most  relevant  to  the  present  piece  of  work,  is  the 
application  of  SEC  to  determine  the  oligomeric  state  of  complexes  as  they  can  be  resolved  in 
native  conditions  in  which  the  macromolecular  interactions  are  conserved  9:  10.  Additional 
advantages  of  SEC  are  its  easy  usage  and  no  requirement  for  specialised  equipment. 

SEC  has  been  successfully  used  for  estimation  of  protein  molecular  weights,  which  involves 
plotting  curves  relating  molecular  weights  and  gel  filtration  behaviour  of  known  standard 
globular  proteins  n.  An  empirical  method  of  plotting  the  log  of  molecular  weight  (logMw) 
against  the  partition  coefficient,  Kav,  gives  a  linear  graph  over  the  middle  part  of  the  working 
range  of  the  column  (Figure  2.3B).  Comparison  of  an  unknown  protein  elution  volume  with 
this  linear  plot  gives  an  estimate  of  the  molecular  weights  12 . 
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Figure  2.3 1  Calibration  of  Superdex  200  10/300  GL  size  exclusion  column  using  BioRad  Gel 
filtration  standards.  (A)  Fractionation  of  a  mixture  of  protein  standards  in  a  calibration  run 
for  molecular  weight  estimation  on  Superdex  200  GL  10/300  at  0.5  mL/min  in  20  mM  Tris, 
pH  8.0,  150  mM  NaCl,  2%  glycerol.  The  peaks  on  the  chromatograph  correspond  to  A- 
thyroglobulin  (670  kDa),  B-y-globulin  (158  kDa),  C-ovalbumin  (44  kDa),  D-myoglobin  (17 
kDa)  and  E-vitamin  Bn  (1.3  kDa).  (B)  Linear  fit  of  Kav  of  the  standards  versus  the  log  of 
molecular  weights  (logMw). 
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2.6.4. 1  Calibration  curves 

For  analytical  SEC,  200  pL  aliquots  of  proteins  were  injected  at  a  flow  rate  of  0.4  mL/min, 
on  a  Superdex  200  GL  10/300  (G  E  Healthcare)  column  (24  mL)  that  was  pre-equilibrated  in 
the  appropriate  buffer.  The  optimum  separation  range  of  the  chosen  matrix  was  10-600  kDa 
13.  In  order  to  estimate  the  molecular  weight  of  the  separated  fractions,  Kav  values  were 
calibrated  using  a  Gel  Filtration  Calibration  kit  (BioRad).  Calibration  was  performed  in  two 
subsequent  runs.  In  the  first  run,  0.2  mL  of  2mg/mL  Blue  Dextran  2000  (2000  kDa)  was 
injected  to  determine  the  void  volume  of  the  column.  Following  the  determination  of  the  void 
volume,  150  pL  of  the  BioRad  calibration  standard  was  injected  as  per  the  manufacturer’s 
instructions.  The  calibration  standard  consisted  of  a  mixture  of  thyroglobulin  (670  kDa),  y- 
globulin  (158  kDa),  ovalbumin  (44  kDa),  myoglobin  (17  kDa)  and  vitamin  Bi2  (1.3  kDa). 
The  log  of  molecular  weight  was  plotted  over  the  Kav  values  of  standard  proteins.  The  values 
were  calculated  from: 


=  Ve-V o  (2-D 

Vc-V0 


Where  Ve  is  the  elution  volume,  V0  the  void  volume  of  column  (7.9  mL)  and  Vc  is  the 
column  volume  (24  mL).  The  molecular  weights  of  proteins  of  unknown  mass  were 
approximated  from  a  linear  fit  to  this  calibration  curve.  Figure  2.3B  shows  the  resulting 
calibration  curve.  All  absorbance  for  the  protein  samples  were  measured  at  wavelength  280 
nm  in  mAU. 
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2.6.5  SEC  in  line  with  static  light  scattering  (SEC-SLS) 

Static  light  scattering  (SLS)  is  a  technique  whereby  an  absolute  molecular  mass  of  a  protein 
sample  in  solution  may  be  experimentally  determined  through  exposure  to  a  low  intensity 
laser  light.  The  intensity  of  the  scattered  light  is  measured  as  a  function  of  angle  and  the  data 
are  analysed  to  calculate  the  molecular  mass.  Along  with  quality  control  of  samples,  SLS  is  a 
precise  method  to  determine  the  oligomeric  state  of  a  protein  sample.  Since  the  light 
scattering  and  concentration  are  measured  for  each  of  the  eluting  fractions,  the  mass  and  size 
can  be  determined  independently  of  the  elution  position.  This  is  particularly  important  for 
novel  protein  architectures  which  may  elute  at  positions  distant  from  that  predicted  by  the 
calibration  for  the  SEC  column. 

The  weight-average  molecular  weight 14  is  given  by: 


M  =  K' 


(LS) 

(RI) 


(2-2) 


Where  K’  is  given  by: 


KR1  (2-3) 


Kri  and  KLs  are  instrument  calibration  constants,  LS  is  the  light  scattering  signal,  RI  is  the 
refractive  index  signal  and  dn/dc  is  the  refractive  index  increment  and  is  taken  as  0.186  mL/g 
for  globular  proteins  14 . 

A  Superdex  200  GL  10/300  column  (G  E  Healthcare)  was  connected  to  a  Viscotek  302-040 
Triple  Detector  GPC/SEC  system  (Malvern  Instruments  Ltd.),  operated  at  28  °C,  and 
equilibrated  with  appropriate  buffer  at  0.4  mL/min  for  a  minimum  of  three  column  volumes 
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until  a  stable  baseline  was  obtained.  110  pL  aliquots  of  protein  samples  were  loaded  onto  the 
column.  Absolute  molecular  weight,  radius  of  hydration  and  size  distributions  were 
calculated  using  the  refractive  index  (RI),  intrinsic  viscosity,  and  right-angle  light  scattering 
(RALS)  measurements  calibrated  against  bovine  serum  albumin  (BSA)  (66.5  kDa,  Sigma), 
which  was  run  at  the  beginning  and  end  of  each  sample  sequence.  Analysis  was  carried  out 
using  OmniSEC  software  as  per  the  instructions. 

2.6.6  Circular  dichroism  spectroscopy  (CD) 

UV  circular  dichroism  spectroscopy  was  used  to  analyse  the  secondary  structure  of  proteins. 
In  proteins,  the  optically  active  groups  are  the  amide  bonds  of  the  peptide  backbone,  and  the 
aromatic  side  chains.  When  these  peptide  chromophores  interact  with  circularly  polarised 
light,  the  secondary  structure  motifs  of  proteins  generate  a  specific  CD  spectrum,  which  is 
used  to  analyse  the  conformation  of  protein  in  solution  15;  16 .  Circular  dichroism  spectra  were 
generally  recorded  from  200  -  260  nm  in  buffer  G.  Protein  samples  were  diluted  sufficiently 
so  as  to  keep  the  high  tension  voltage  of  the  photomultiplier  detector  below  600  (usually 
1:500  dilutions).  Data  were  collected  on  a  Jasco  J-815  spectropolarimeter  in  a  2  mm  cuvette 
with  a  2  second  response  time.  For  each  experiment,  ten  spectra  were  summed  and  averaged 
to  maximise  signal-to-noise.  Spectra  were  analysed  by  Jasco  Spectra  Manager  software 
(Version  1.52). 
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2.6.7  Small  angle  X-ray  scattering  (SAXS) 

Small  angle  X-ray  scattering  (SAXS)  is  a  technique  to  carry  out  structural  characterisation  of 
biological  macromolecules  in  solution  at  relatively  low  resolution  (1-3  nm).  It  provides 
information  about  the  conformation  and  overall  structure  of  large  assemblies  (Figure  2.4A). 
Compared  to  X-ray  crystallography  and  nuclear  magnetic  resonance  (NMR),  SAXS  offers 
some  clear  advantages  17;  (i)  All  experiments  are  carried  out  in  solution,  (ii)  Large  complex 
macromolecular  assemblies  can  be  analysed,  (iii)  Sample  preparation  is  fast  and  simple,  (iv) 
The  technique  is  compatible  with  other  biophysical  methods.  With  synchrotron  sources 
readily  available,  computational  and  instrumentation  advances,  SAXS  is  fast  gaining 
importance  as  a  reliable  technique  to  study  large  organised  protein  assemblies  18.  In  this 
thesis,  SAXS  was  employed  to  extract  geometric  properties  such  as  radius  of  gyration  ( Rg ) 
and  maximal  particle  diameter  (Dmax)  to  obtain  shape  and  assembly  information  of  the 
macromolecular  assemblies. 

2. 6. 7. 1  Theory  of  small  angle  X-ray  scattering 

In  solution  scattering,  the  signal  from  all  orientations  of  the  target  molecules  relative  to  one 
another  and  the  experimental  apparatus,  are  averaged  together.  Solution  scattering  is 
continuous  and  radially  symmetric  (isotropic).  Being  essentially  a  contrast  method,  the 
scattering  signal  in  SAXS  is  derived  from  the  difference  in  the  average  electron  density, 
Ap(r),  and  bulk  solvent,  p(s) 19 ,  presented  by  the  relation: 

Ap(r)  =  p(r)  —  p(s)  (2-4) 

Data  are  collected  on  a  buffer  blank  and  on  a  sample.  Subtraction  of  observed  scattering 
yields  the  signal  from  the  scattering  due  to  the  macromolecule. 
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Figure  2.4\  ( A )  Schematic  representation  of  small  angle  X-ray  scattering.  Adapted  from 
Mertens  and  Svergun  2010.  (B)  Scattering  intensities  and  distance  distribution  functions,  P(r) 
calculated  for  typical  geometric  shapes.  Solid  shapes  (black),  prolate  ellipsoid  (red),  oblate 
ellipsoid  (blue),  two  domain  (green)  and  long  rod  (cyan).  Diagram  taken  from  Mertens  and 
Svergun,  2010. 
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SAXS  exploits  coherent  X-ray  scattering  given  by  the  Thompson  relation  20 : 

E(r,  t)  =  —  r0(sin\|//r)E(t)  (2-5) 


wherein  r0  decribes  the  electron  radius,  \| r  polarisation  angle,  r  the  distance  between  the 
observer  and  the  scattering  event. 

The  angle  between  the  wave  vectors  of  incident  and  scattered  waves  is  called  the  scattering 
angle  (20)  (Figure  2.4A)  and  is  related  to  the  intensity  of  Thompson  scattering  for  an 
incident  beam  of  intensity  lQ  by: 


7(20)  =  r  2 


'l  +  cos2  20\  I0 


(2-6) 


Scattering  from  an  assembly  of  electrons  (in  the  case  of  macromolecules)  is  given  by  the  sum 
of  the  waves  originating  from  each  electron  in  the  given  volume.  Separation  between  the 
scatterers  gives  rise  to  a  phase  difference  (j),  given  by: 


(|)  =  qA 


(2-7) 


\q\  =  471 


sin0 

~Y~ 


(2-8) 


where  |q|  is  the  momentum  transfer  vector  and  wavelength  X. 

The  pair  distribution  function  P(r)  is  also  called  the  pair  density  distribution  function.  It  is  a 
radially  averaged  autocorrelation  function  calculated  through  a  Fourier  transform  of  the 
scattering  curve  and  provides  direct  information  about  the  distances  between  electrons  in  the 
scattering  particles  in  the  sample  21 .  The  P(r)  plot  is  also  an  indicator  of  good  quality  data. 
Typically  the  P(r)  function  is  zero  at  r  =  0  and  at  r  >  Dmax.  If  such  a  condition  is  not  satisfied, 
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it  may  suggest  the  presence  of  unfolded  proteins  and  aggregate  formation.  P(r)  plots  also  give 
a  fairly  good  indication  of  the  overall  shapes  of  the  molecules  (Figure  2.4B). 

For  a  homogenous  solution,  the  scattering  intensity  over  the  entire  range,  q,  results  from  the 
spherically  averaged  electron  distribution  P(r)  of  the  macromolecule  22: 


P(r )  =  Kq)q  sin(qr)  dq 


(2-9) 


Since,  theoretically  for  a  homogenous  sample,  the  I(q )  is  measured  only  for  a  finite  number 
of  points  rather  than  [0,co],  it  is  preferable  to  compute  P(r)  by  inverse  transformation  over  the 
interval  [0 ,Dmax],  and  hence  get 20: 


Dmax 


* 

Kq )  =  P(X) 

u 


sin  (qr) 

qr 


(2-10) 


dr 


Guinier  approximation  helps  describe  the  scattering  at  low  resolution 


19 


Kq)  =  /(o)exp 


(2-11) 


Where  Rg  is  the  radius  of  gyration  of  the  particle. 

Rg  and  HO)  can  be  extracted  from  the  Guinier  plot  of  log(/(0))  against  q2,  which  gives  a 
straight  line.  Generally  for  globular  proteins,  Guinier  approximation  is  valid  over  the  range  of 
q  Rg  <  1.3  19 .  Guinier  plots  are  also  a  means  of  quality  control  of  the  sample,  as  a  non¬ 
linearity  in  the  Guinier  range  can  indicate  elongated  samples  and  requires  careful  analysis  of 
data. 
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1(0),  intensity  at  q= 0,  depends  on  the  square  of  the  number  of  electrons  (molecular  weight) 
hence  SAXS  is  particularly  useful  to  determine  the  assembly  of  proteins  17 .  Calculation  of 
molecular  weight  from  1(0)  is  less  susceptible  to  inter  particle  correlations  than  extrapolation 
of  low  q  data.  P(r)  based  1(0)  method  has  the  following  relation. 

Dmax  (2-12) 

7(0)  =  47 r  J  P(r)dr 

o 

At  low  angles,  the  scattering  intensities  of  folded  macromolecules  follow  Porod’s  law  17 

I(q)  oc  q~4  (2-13) 

However  at  higher  angles,  this  relation  does  not  hold  true  and  must  be  expanded  to  describe 
the  fractal  degrees  of  freedom  (df)  17 : 


I(q)  =  q~df  (2-14) 

SAXS  is  an  ideal  method  to  study  unfolded  domains  in  proteins.  The  Kratky  plot  which  is 
I(q )  against  q  can  reveal  the  degree  of  unfolding  depending  on  the  shape  of  the  plot.  For  a 
folded  protein,  the  graph  yields  a  peak  shaped  like  a  parabola  at  low  angles  and  a  plateau  at 
higher  angles.  Unfolded  domains  clearly  lack  this  peak  23. 
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SAXS  experiments  were  performed  at  the  SAXSAVAXS  beamline  at  the  Australian 
Synchrotron  (Melbourne,  Australia).  A  monochromatic  X-ray  beam  with  a  wavelength  of 

O 

1.03320  A  was  used.  The  range  of  momentum  transfer  q  =  An  /  sin0,  was  adjusted  to  0.010  < 
q  <  0.618  A"1  for  experiments  on  this  beamline.  A  Dectris-Pilatus  detector  was  used  to  record 
scattering  patterns  (1  m,  170  mm  x  170  mm,  effective  pixel  size,  172  x  172  pm).  All  the 
protein  samples  were  adjusted  to  2  mg/mL  and  cleared  either  by  centrifugation  (12000  rpm,  2 
min)  or  by  0.22  pm  syringe  filter.  A  Superdex  200  5/150  GL  SEC  column  was  used  in  line 
with  SAXS  collection  data.  100  pL  aliquots  of  protein  were  injected  into  the  SEC  column 
and  the  scattering  data  collected  as  each  fraction  eluted  from  the  column. 

SAXS  data  were  collected  at  room  temperature  using  an  exposure  time  of  2  s  per  image. 
Averaged  scattering  from  the  buffer  was  recorded  and  used  for  background  subtraction. 
Native  Lsma  samples  and  muteins  were  measured  in  buffer  G  and  [H6Lsma]7  was  analysed 
in  buffer  A.  Data  reduction  and  subtraction  were  performed  using  ScatterBrain  (Australian 
Synchrotron)  software  24.  The  resulting  data  were  processed  using  the  ATSAS  25  package. 
Guinier  approximations  were  carried  out  using  PRIMUS  .  Electron  distribution  functions 
were  calculated  in  GNOM  27  and  normalised  with  respect  to  P(r)  by  division  of  all  data  points 
through  their  respective  P(r)  maxima.  For  determination  of  molecular  envelopes,  GASBOR 
23  was  employed.  Comparison  of  the  solution  scattering  of  the  experimental  with  that  of  the 
theoretical  scattering  was  evaluated  using  CRYSOL  28. 


62 


Chapter  two :  Materials  and  methods 


2.6.8  Negative  stain  transmission  electron  microscopy  (TEM) 

Transmission  electron  microscopy  uses  a  high  energy  electron  beam  transmitted  through  a 
thin  sample  to  image  and  analyse  the  microstructure  of  materials.  TEM  is  now  a  standard 
technique  to  visualise  macromolecular  assemblies  29  and  supramolecular  complexes  in 
biological  systems  30.  Negative  stain  TEM  utilises  heavy  metal  salts  to  coat  the  protein 
molecules,  thereby  creating  differential  scattering  of  electrons.  This  causes  a  phase  contrast 

Q  1 

forming  an  image  of  the  material  on  the  grid  (Figure  2.5).  Negative  stain  TEM  has  been 
utilised  in  this  project  to  study  the  novel  structures  formed  by  Lsma. 

Carbon-coated  Formvar  200-mesh  copper  grids  (ProSciTech,  Australia)  were  deposited 
successively  for  one  minute  each  onto  12  pL  drops  of  (i)  protein  samples,  (ii)  water  (3  times) 
and  (iii)  a  2%  w/w  uranyl  acetate  solution  in  water.  Filter  paper  (Whatman  #1)  was  used  to 
remove  the  excess  liquid  from  the  grids,  which  were  then  left  to  dry  for  a  few  hours  before 
electron  microscopy  observations.  The  grids  were  examined  with  a  FEI  Morgagni  268D 
transmission  electron  microscope  operating  at  80  kV,  with  magnifications  up  to  180k. 
Micrographs  were  captured  using  a  SIS/Olympus  Megapixel  III  digital  camera  mounted 
above  the  phosphor  screen. 

The  EM  AN  vl.9  software  suite  32;  33  was  used  to  generate  a  two-dimensional  average 
molecular  image  of  the  generated  complexes  from  447  individual  particle  images,  selected 
from  30  micrographs.  Particles  were  interactively  selected  using  boxer;  autoboxed  particles 
were  visually  checked  and  poor  quality  images  were  manually  deleted.  To  reduce  high-and 
low-frequency  noise  components,  a  2  nm  to  20  nm  band-pass  filter  was  applied  to  the  raw 
images,  which  were  subsequently  normalised.  Parameters  of  individual  micrographs  were 
determined  and  corrected  for  using  EMAN  vl.9  methodologies.  Reference-free  class 
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averages  were  generated  in  8  rounds  of  iterative  refinement  by  using  the  refine2d.py  routine 
in  EMANvl.9. 


C 


Figure  2.5 1  Diagram  showing  components  used  in  negative  stain  transmission  electron 
microscopy.  (A)  Copper  grid  200  mesh  square.  (B)  Transverse  section  of  grid  showing 
carbon  and  formvar  layers.  (C)  Electron  scattering  through  protein  sample  stained  with 
heavy  metal  salts. 
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3  Chapter  three 

Investigation  of  the  Lsma  interfaces  using  site  directed 
mutagenesis 

3.1  Introduction 

Self-assembling  biomolecules  have  formed  a  niche  as  molecular  building  blocks  with 
potential  bionanotechnological  applications  1;  2.  In  these  applications,  the  modular  nature  and 
the  ability  to  control  the  oligomeric  state  of  ring-shaped  proteins  are  desirable.  The  change  in 
the  size  of  the  cavity  provides  additional  control  over  the  dimensions  and  morphology  of  the 
nanostructures  obtained.  The  inner  pore  lining  also  provides  a  malleable  site  to  chemically 
engineer  functional  groups  that  would  enhance  the  functionality  of  the  supramolecular 
structures.  For  this  thesis,  change  in  the  pore  size  of  Lsma  was  investigated  by  altering  the 
relevant  amino  acid  residues  involved  in  the  interactions  at  the  subunit  interface  of  each 
monomer. 

As  described  in  Chapter  one,  Lsm  proteins  are  found  in  a  number  of  oligomeric  states  in 
different  species.  The  subunit  interface  was  engineered  to  tune  the  ring  diameter.  The  first 
part  of  this  chapter  deals  with  the  preparation  and  isolation  of  recombinant  Lsm  proteins  and 
the  muteins.  The  second  part  discusses  the  solution  studies  with  comparison  to  the  wild  type 
protein. 
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3.2  Lsm  systems  investigated  for  change  in  pore  diameter 

The  modular  nature  of  a  self-assembling  system  provides  a  platform  to  control  and  modify 
the  individual  subunits  in  order  to  generate  a  biological  nanostructure  3.  The  oligomeric 
plasticity  exhibited  by  the  Lsm  proteins  (as  discussed  in  Section  1.8)  make  them  an  attractive 
target  for  manipulation  and  control  over  the  pore  size  by  altering  the  number  of  subunits 
interacting  to  form  a  ring-shaped  structure.  Two  Lsm  proteins  from  different  species  were 
looked  into  for  altering  their  oligomeric  states  to  obtain  varying  pore  size  of  the  ring. 


The  Lsm3  polypeptide  from  S.  cerevisiae  assembles  as  an  octameric  ring  when  expressed 
using  recombinant  methods  4.  Eukaryotic  Lsm  proteins  generally  adopt  a  heptameric 
quaternary  state,  it  was  therefore  of  interest  to  study  the  Lsm3  interface.  The  second  system 
chosen  was  a  thermophilic  and  highly  stable  protein,  the  Lsma  system  (from  M. 
thermoautotrophicum ).  Figure  3.1  shows  the  various  crystal  structures  of  ring-shaped 
proteins  that  have  been  studied  for  their  potential  nanotechnological  applications  5;  6  and  have 
been  modified  to  generate  useful  nanostructures  1 .  The  Lsm  scaffold  presents  one  of  the 
smallest  ring  diameters  to  work  with,  adding  a  new  dimension  to  the  nanostructures 
generated. 

A  B  C  D 


6.5  nm  8  nm  11  nm  13  nm 


Figure  3.1 1  Crystal  structures  of  the  ring-shaped  proteins.  (A)  Lsma  (PDB  ID  1181)  6.5  nm 
ring  8.  (B)  TRAP  protein  ( PDB  ID  1QAW)  8  nm  ring  9 .  (C)  Stable  protein  1  ( PDB  ID  ITRO) 
11  nm  ring  10 .  (D)  GroEL  (PDB  ID  1GRL)  ~  13  nm  ring  11 .  All  proteins  shown  approximately 
to  scale. 
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3.3  Yeast  Lsm3  system 

An  inspection  of  the  Lsm3  subunit  interface  from  X-ray  crystallographic  data  shows  that  the 
presence  of  an  extra  hydrogen  bond  and  an  absence  of  aromatic  residues  results  in  opening  of 
the  ring  and  hence,  accommodation  of  a  larger  number  of  monomers  in  the  ring  complex  12. 
From  the  structural  alignment  of  Lsm  proteins  (Figure  1.6),  it  is  evident  that  Lsm3  has  a 
longer  loop4  region  compared  to  other  Lsm  proteins.  This  has  been  implicated  in  protein 
binding  functions  12.  In  addition,  the  C-terminal  tail  of  each  Lsm3  polypeptide  engages  in  P- 
sheet  interactions  across  the  rings  4  (Figure  1.9F).  To  investigate  its  assembling  properties, 
the  expression  and  solubility  of  Lsm3  was  trialled  to  obtain  sufficient  quantities  of  the 
protein. 

3.3.1  Yeast  Lsm3  expression  system 

An  N-terminally  His6-tagged  version  of  yeast  Lsm3,  cloned  in  plasmid  pCL774,  was  obtained 
as  a  bacterial  ( E .  coli,  BL21(DE3)  pLysS)  glycerol  stock,  from  the  Protein  Structure  Group, 
Macquarie  University  (Section  2.2.3).  The  stock  was  used  to  inoculate  a  starter  culture  and 
incubated  overnight  at  37  °C,  180  rpm.  This  was  diluted  into  fresh  media  (4  mL  into  400  mL) 
and  grown  to  an  OD60o  of  0.5-0. 6  (Section  2.4.3).  The  protein  expression  was  induced  by 
addition  of  IPTG  (0.2  mM)  and  the  cells  were  harvested  by  centrifugation  after  growth  at  26 
°C  for  4  h.  The  pellet  was  resuspended  in  buffer  B  and  protein  extracted  by  high  pressure 
homogenisation  (Section  2.5.1.2).  The  lysate  was  centrifuged  at  15,000  rpm,  4  °C  for  25  min 
to  remove  the  cell  debris.  The  supernatant  containing  the  soluble  protein  was  filtered  (0.22 
pm)  and  applied  to  the  affinity  columns. 
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Adapting  protocols  established  previously  4,  a  HisTrap  column  (G  E  Healthcare)  pre-charged 
with  Ni2+  was  used  to  purify  Lsm3.  The  clarified  cell  lysate  was  directly  loaded  onto  the 
matrix  using  a  peristaltic  pump  and  the  elution  step  carried  out  by  increasing  the  imidazole 
concentration  to  500  mM  in  a  gradient  mode  (Section  2.5.2),  The  His6-tagged  Lsm3  protein 
failed  to  bind  to  the  HisTrap  matrix  since  most  of  it  is  seen  in  the  flowthrough  and  wash 
fractions,  as  assessed  by  SDS-PAGE  (Figure  3.2 A).  There  was  no  protein  that  eluted  off  the 
IMAC  column  (Figure  3.2B).  Hence,  its  purification  failed  using  this  column. 

A  Hi-Trap  Chelating  system  (G  E  Healthcare)  pre-charged  with  Ni2+  was  then  used  to  purify 
Lsm3.  The  clarified  lysate  was  loaded  onto  the  column,  pre-equilibrated  with  buffer  B 
(Section  2.5.2),  Following  washing  with  Tris-based  buffer  in  high  salt  at  pH  8.0,  bound  Lsm3 
was  eluted  by  increasing  the  imidazole  concentration  to  350  mM  (70%  of  the  gradient) 
(Figure  3.3B).  The  differential  binding  of  Lsm3  to  the  affinity  matrix  could  arise  due  to  the 
presence  of  iminodiacetic  acid  (IDA)  ligand  (Hi-Trap  Chelating  matrix)  as  opposed  to  NTA 
ligand. 


Figure  3.21  (A)  SDS-PAGE  of  Lsm3  showing  stages  of  IMAC  purification  by  HisTrap  column 


(G  E  Healthcare).  Arrow  indicates  the  absence  of  monomeric  form  in  the  elution  fraction 
corresponding  to  ~  10  kDa.  (B)  IMAC  chromatography  trace.  Absorbance  at  280  nm  is 
shown  in  black  and  concentration  of  imidazole  is  in  red. 
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(HisTrap  column  matrix).  Lsm3  was  obtained  as  a  highly  pure  sample  with  a  yield  of  40  mg 
from  400  mL  of  culture.  The  SDS-PAGE  gel  shows  the  complete  dissociation  of  the  protein 
into  its  monomer  after  boiling  for  5  min.  A  large  band  was  obtained  at  ~  10  kDa 
corresponding  to  the  monomeric  form  of  Lsm3  (Figure  3.3A).  The  purified  sample  was 
collected  and  subjected  to  analytical  size  exclusion  chromatography  (SEC). 

The  trace  obtained  by  analytical  SEC  equilibrated  with  Tris/NaCl  buffer  pH  8.0  (Figure  3.4), 
shows  that  the  majority  of  the  affinity  purified  sample,  eluted  at  the  void  volume  (fraction  I), 
indicating  aggregation.  A  small  peak  (fraction  II)  was  obtained  at  Kav  corresponding  to  ~40 
kDa.  Thus,  the  affinity  purified  Lsm3  sample  was  mostly  aggregated  in  solution.  The  sample 
also  showed  signs  of  visible  precipitation  after  a  few  hours  post-purification  when  stored  at  4 
°C. 


Since  Lsm3  was  prone  to  aggregation  during  purification  it  was  deemed  to  be  not  stable 
enough  to  act  as  a  tecton  and  no  further  work  was  done  on  this  protein. 


B 


Figure  3.31  (A)  SDS-PAGE  of  Lsm3  showing  stages  of  IMAC  purification  by  HiTrap 
Chelating  column  (G  E  Healthcare ).  Arrow  indicates  the  monomeric  form  of  Lsm3  in  the 
elution  fraction.  (B)  IMAC  chromatography  trace.  Absorbance  at  280  nm  is  shown  in  black 
and  concentration  of  imidazole  is  in  red. 
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Figure  3.4\  Analytical  SEC  ofLsm3.  The  sample  was  run  in  Tris/NaCl  buffer  pH  8.0  at  a  flow 
rate  of  0.4  mL/min  on  a  Superdex  200  GL  10/300  (G  E  Healthcare )  column. 


3.4  Design  of  Lsma  muteins 

Muteins  were  designed  to  investigate  residues  involved  in  controlling  the  pore  size  and  obtain 
a  toroid  of  a  different  oligomeric  state.  To  achieve  this,  residues  were  chosen  that  were 
considered  important  for  the  hydrogen  bonding  and  hydrophobic  interactions  between  the 
subunits  of  the  Lsma  heptameric  ring  13.  In  particular  the  residues  constituting  the  P4-  and 
P5-strands,  known  to  be  involved  in  ring  formation  were  targeted.  The  mutations  were  chosen 
to  structurally  conserve  the  Lsm  fold  per  monomer  and  simultaneously  perturb  the  hydrogen 
bonding  network  between  the  monomers.  For  each  of  the  muteins  generated,  it  was  predicted 
that  the  toroid  formation  would  be  altered  and  result  in  a  different  oligomeric  state  of  the 
protein  during  the  in  vivo  assembly  of  the  Lsma  monomers. 

Previous  work  on  Lsm  proteins  from  different  sources  have  shown  that  different  sub¬ 
complexes  can  be  generated  by  covalently  linking  different  combinations  of  Lsm 
polypeptides.  Heterodimeric  and  heterotrimeric  sub-complexes  were  reconstituted  from  the 
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components  of  human  Lsml-7  and  Lsm2-8  in  vitro  14 .  More  recent  work  shows  the  formation 
of  varying  oligomeric  forms  adopted  by  the  subunits  of  Lsm  protein  from  S.  pombe  15 .  To  our 
knowledge,  to  date,  no  work  has  been  carried  out  investigating  the  inter-subunit  interactions 
between  Lsma  monomers,  which  may  hold  answers  to  the  oligomeric  plasticity  of  Lsm 
proteins  in  general  16.  The  following  sections  detail  the  generation  of  the  site  specific 
mutations,  their  expression  and  purification  and  solution  state  characterisation  in  comparison 
to  the  wildtype  heptameric  Lsma. 

Another  strategy  employed  to  alter  the  pore  size  of  Lsma  was  the  generation  of  chimeric 
proteins.  Whole  sections  of  the  P4-  and  p5-strands  of  Lsma  were  replaced  with  appropriate 
sections  from  the  E.  coli  Hfq  protein.  It  was  hypothesised  that  the  nature  of  interactions  at  the 
subunit  interface  could  be  altered  by  the  exchange  of  segments  between  a  hexameric  toroid 
and  a  heptameric  toroid.  The  generation  and  expression  of  these  chimeric  proteins  is  also 
presented  in  this  chapter. 

3.5  Structure  of  Lsma  cyclic  oligomer  complex 

As  mentioned  in  Section  1.7,  Lsm  proteins  have  the  propensity  to  assemble  into  ring-shaped 
structures.  Lsma  assembles  as  a  cyclic  oligomer  of  seven  identical  subunits,  with  an  overall 

o 

diameter  of  6.5  nm  and  pore  diameter  of  1.5  nm  .  Within  the  heptameric  ring  the  individual 
monomers  interact  via  hydrogen  bond  pairing  of  the  P4-  and  p5-strands  coupled  with 
hydrophobic  and  hydrophilic  interactions.  There  is  formation  of  the  left  handed  propeller-like 
structure  as  each  monomer  interacts  with  its  adjoining  neighbours  via  P-sheet  extension, 
whereby  the  p4-strand  residues  are  hydrogen  bonded  to  the  P5 -strand  of  the  neighbouring 
monomer  17 . 
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In  the  subunit  A/subunit  B  interface,  the  complex  is  stabilised  by  hydrophobic  contacts  as 
well  as  hydrophilic  interactions  and  a  salt  bridge  between  Glu-35  (subunit  A)  and  Arg-64 
(subunit  B).  There  are  two  hydrophobic  pockets  enclosed  within  each  interface  that  extend  to 
all  seven  subunits,  resulting  in  the  formation  of  the  heptameric  ring.  The  first  pocket  consists 
of  Ile-27,  Val-77  and  Tyr-78  of  chain  A,  and  Leu-30,  Phe-36,  Leu-66,  Val-69  and  Ile-71  of 
chain  B.  A  second  hydrophobic  pocket  is  formed  between  side  chains  of  the  amphipathic  a- 
helix  and  hydrophobic  core  of  chain  A,  with  Val-50  and  Leu-70  of  chain  B  17 .  Enclosure  of 
these  hydrophobic  cores  of  all  component  monomers  results  in  the  formation  of  a  stable 
cyclic  oligomer. 

The  loop2  (Lys-31  and  Gly-32),  loop3  (Leu-45  and  His-46)  and  loop5  (Gly-73  and  Asp-74) 
of  all  seven  subunits  line  the  inner  hole  of  the  Lsma  ring,  while  the  outside  edge  of  the  ring  is 
formed  by  the  curved  (1-sheets  of  each  protein  chain.  This  arrangement  produces  two  distinct 
faces  for  the  complex,  with  the  N-terminal  helices  covering  one  face,  and  the  loop4 
dominating  the  opposite  surface.  These  faces  will  be  hereby  referred  to  as  the  helix  side  and 
the  loop4  side,  respectively. 
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Figure  3.51  Interface  regions  of  Lsma..  64-  and  65 -strands  ( shown  in  green)  were  selected 
to  be  mutated.  Black  lines  denote  the  hydrogen  bonds  between  the  residues.  Dotted  areas 
represent  the  regions  in  the  Lsma  fold  not  considered  for  any  change. 


76 


Chapter  three:  Investigation  of  Lsma  interfaces 


3.6  Generation  of  Lsma  interface  mutations 

The  residues  involved  in  the  subunit  interface  were  determined  through  analysis  of  the 
interface  carried  out  using  the  PDBePISA  18  online  bioinformatics  tool  which  lists  the 
interactions  by  identifying  the  residues  that  become  buried  when  two  subunits  come  together 
and  analysing  the  proximity  and  bonding  potential  of  these  residues.  It  is  a  useful  tool  for  the 
exploration  of  macromolecular  interfaces. 

Analysis  of  the  Lsma  interface  in  PDBePISA  revealed  an  intricate  network  of  hydrogen 
bonds  and  hydrophobic  interactions.  Essentially,  the  most  important  and  consequently  the 
region  probed  were  the  p4-strand  and  P5-strand  of  each  monomer,  which  consist  of  at  least 
five  hydrogen  bonds  as  listed  in  Table  3.1.  Two  residues  located  at  the  a-helix,  N10  and 
Q12,  were  also  chosen  as  they  form  hydrogen  bond  with  S42  of  the  adjacent  subunit.  The 
residues  were  chosen  from  this  list  and  mutated  to  alanine  to  eliminate  the  bond  formation. 
Since  the  nature  of  interactions  at  the  monomer  interface  is  very  complex,  single  residues 
were  chosen  to  be  mutated  thereby  altering  the  overall  pattern  of  hydrogen  bond  formation  to 
a  relatively  small  extent. 


Table  3.11  Residues  involved  in  hydrogen  bond  formation  at  p4/p5  interface*. 


Subunit  A 

Subunit  B 

1 

N10,  Q12 

S42 

2 

L70 

179 

3 

R72,  N75 

V77 

4 

E56 

Y78 

5 

T68 

S80 

*  Evaluated  in  PDBePISA. 
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Figure  3.6\  Ribbon  representation  of  the  residues  mutated  at  the  subunit  interface  of  Lsma. 

Previous  work  on  Lsma  has  attempted  to  identify  the  RNA-binding  residues  and  elucidate  the 
biological  function  of  Lsm  proteins  including  Lsma  19,  20.  Intensive  work  on  the  residues 
involved  at  the  subunit  interface  was  lacking  in  these  studies.  Hence  the  residues  spanning 
the  P4-  and  P5 -strands  were  each  mutated  to  alanine.  Alanine  screening  has  been  used  to 
identify  hotspots  in  protein  in  protein  interfaces  and  successfully  disrupt  interactions  21;  22. 
Mutations  to  alanine  are  effectively  a  side  chain  deletion,  as  the  hydrogen  bonding  potential, 
charge  and  the  steric  properties  of  the  residues  are  largely  eliminated.  Single  residue  alanine 
substitutions  were  therefore  introduced  to  the  Lsma  subunit  interface  at  positions  involved  in 
direct  hydrogen  bonding  (Figure  3.5). 

Archaeal  Lsm  proteins  exhibit  a  certain  degree  of  oligomeric  plasticity.  Lsm  proteins  from 
A.  fulgidus  belonging  to  two  different  subfamilies,  Sml  and  Sm2,  assemble  as  a  heptameric 
and  a  hexameric  complex,  respectively  23.  The  authors  explain  this  difference  to  arise  from 
the  absence  of  a  set  of  electrostatic  interactions  constituted  by  a  highly  conserved  negatively 
charged  residue  in  the  case  of  Sm2  archaeal  proteins.  Thus,  if  this  electrostatic  interaction  is 
eliminated  in  the  case  of  Lsma,  it  was  predicted  to  assemble  as  a  hexameric  complex. 
However,  due  to  the  involvement  of  this  residue  in  RNA  binding,  it  would  have  diminished 
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the  functionality  of  the  Lsma  scaffold  and  further  downstream  applications  involving  RNA 
binding  would  have  become  problematic.  In  addition  to  altering  the  inter-subunit  interface, 
residues  located  at  the  helix  faces  were  also  chosen  to  probe  possible  formation  of  ring-to- 
ring  association. 

The  mutations  made  to  the  Lsma  ring  scaffold  can  be  generalised  with  respect  to  the  location 
on  the  ring  surface  and  on  the  structural  elements  as  follows: 

(i)  Helix  face/loop  face 

N10  and  Q12  residues  are  involved  in  interactions  with  adjacent  subunits  as  described  in 

Table  3.2. 

(ii)  p 4- strand 

The  p4-strand  is  comprised  of  eleven  residues  in  the  case  of  Lsma,  out  of  which  four  were 
chosen  to  be  mutated  to  alanine.  These  residues  either  make  up  the  hydrophobic  core  or  are 
involved  in  the  hydrogen  bonding  at  the  monomeric  interface.  The  muteins  generated  were 
R65P,  V69A,  L70A  and  171  A.  R65  is  located  along  the  edge  of  the  Lsma  scaffold  and  is 
solvent  accessible.  Changing  it  to  a  proline  was  predicted  to  alter  the  bend  in  the  strand  and 
hence  influence  the  angle  between  the  monomers. 

Although  archaeal  Lsm  proteins  assemble  as  heptameric  complexes,  there  is  evidence  of  a 
hexamer  formation  in  the  case  of  Af  Sm2  (Lsm  protein  from  species  A.  fulgidus )  .  As  seen 

from  the  structural  alignment,  for  Af  Sm2,  positions  70  and  71  of  the  p4-strand  consist  of 
valine  in  place  of  leucine  and  isoleucine,  respectively  (Figure  1.7).  It  was  hypothesised  that 
by  altering  the  side  chain  to  the  methyl  group  of  alanine,  the  hydrophobic  interactions  would 
decrease  at  the  interface  causing  a  change  in  assembly  of  monomers,  thereby  altering  the 
oligomeric  state  of  the  Lsma  ring. 
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Table  3.21  Summary  of  the  mutations  made  to  the  Lsma  scaffold*. 


Region 

Mutation 

Structure/function  role 

>< 

N10A 

Asparagine  is  a  polar  interfacing  residue  involved  in  hydrogen 
bond  formation  with  Ser  at  position  42  of  the  adjacent  monomer 
thereby  contributing  to  the  formation  of  cyclic  oligomer. 

43 

i 

d 

Q12A 

Similar  to  N10,  it  is  an  interfacing  polar  residue  which  forms 
hydrogen  bond  with  S42  of  adjacent  monomer. 

R65P 

R65  faces  towards  the  solvent. 

73 

a 

C3 

& 

Y69A 

Valine  forms  part  of  the  hydrophobic  pocket  driving  the  oligomeric 
assembly  of  the  ring  scaffold. 

C/3 

4 

0Q_ 

L70A 

Leucine  takes  part  in  hydrogen  bond  formation  with  179  and  forms 
part  of  the  hydrophobic  core  of  the  interface. 

171 A 

17 1  is  an  interfacing  hydrophobic  residue. 

V77A 

Hydrophobic  residue  involved  in  hydrogen  bond  with  N75  and 

R72  of  the  corresponding  adjacent  subunit. 

73 

Y78A 

Involved  in  hydrogen  bond  formation  with  E56  and  forms  part  of 
the  hydrophobic  pocket. 

£ 

c3 

-t— > 

C/3 

1 

in 

179  A 

Hydrogen  bond  formation  with  L70  and  part  of  hydrophobic  core. 

CCL 

S80A 

Penultimate  residue  of  C -terminus  involved  in  hydrogen  bond 
formation  with  T68. 

P81T 

Terminates  the  C-terminal  tail.  Engineered  to  tyrosine  to  enable 
hydrogen  bonding  capacity. 

*  Interfacing  residues  which  are  involved  in  hydrogen  bond  formation  as  evaluated  in  PDBePISA18. 
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(Hi)  p 5-strand 

The  p5-strand  is  comprised  of  seven  residues  from  the  structural  alignment.  The  residues 
changed  to  alanine  were  V77,  Y78,  179  and  S80.  The  Lsma  sequence  is  characterised  by  an 
unusually  short  C-terminal  and  terminated  by  a  proline.  Therefore,  it  was  hypothesised  that  if 
more  hydrogen  bonds  are  included  at  the  C-terminus,  the  result  would  be  a  different 
molecular  oligomer.  Hence,  the  last  residue,  proline,  was  mutated  to  tyrosine  (P81T)  to 
mimic  the  C-terminus  of  the  octameric  Lsm3,  and  potentially  induce  the  formation  of  an 
octameric  Lsma  ring. 

3.7  Expression  and  purification  of  Lsma  muteins 

To  study  the  impact  of  the  site  specific  single  mutations  on  the  quaternary  structure  of  the 
Lsma  ring,  the  samples  were  prepared  recombinantly  as  GST-fusion  products.  Although 
using  an  affinity  peptide  is  easier  and  more  efficient  to  produce  a  recombinant  protein,  it  was 
found  that  the  His6-tag  influences  the  quaternary  organisation  of  the  Lsma  ring  scaffold.  This 
is  discussed  in  detail  in  Chapter  5.  The  GST-tag  has  been  shown  to  protect  against 
intracellular  protease  cleavage  and  otherwise  stabilise  the  recombinant  protein  24 .  The  tag  can 
be  easily  cleaved  with  a  site  specific  protease  and  the  recombinant  protein  purified  using 
glutathione  affinity  matrix  25.  This  also  provided  access  to  a  protein  that  was  definitively 
without  a  His-tag,  an  important  control  protein  for  the  work  in  Chapter  5. 

The  wildtype  Lsma  and  the  mutated  genes  were  cloned  into  pGEX-4T-2  vector  and  ordered 
from  Epoch  Life  Sciences  (Sugar  Land,  Texas,  USA).  A  conventional  strain  of  E.  coli  BL21, 
was  transformed  with  the  appropriate  vectors  and  small  scale  expression  trials  were  carried 
out  (Section  2.4.2).  Protein  levels  in  the  cells  were  assessed  by  SDS-PAGE  analysis.  The 
protein  expression  was  induced  by  IPTG  (0.2  mM)  and  the  cells  grown  for  16  h  post- 


81 


Chapter  three:  Investigation  of  Lsma  interfaces 


induction  at  26  °C  with  shaking  at  180  rpm  (Section  2.3.4).  All  the  muteins  showed  good 
level  of  expression  for  large  scale  preparation.  As  outlined  in  Section  2.5.1,  the  samples  were 
prepared  for  purification  by  the  glutathione  Sepharose  affinity  method. 

Pure  protein  samples  of  all  the  muteins  were  obtained  using  a  glutathione  Sepharose  matrix 
followed  by  thrombin  cleavage,  using  a  purification  scheme  as  described  in  Section  2.5.3. 
The  filtered  cell  lysate  was  incubated  with  the  pre-washed  glutathione  affinity  matrix 
required  for  binding  of  the  fusion  protein  to  the  matrix.  Wash  steps  were  included  to  ensure 
the  removal  of  any  loosely  bound  contaminant  proteins.  The  matrix  was  poured  into  a  gravity 
flow  column  and  incubated  with  thrombin  for  efficient  cleavage  of  the  GST-tag  from  the 
protein  of  interest.  Typical  yields  for  all  the  muteins  were  around  5-10  mg  per  400  mL  of 
culture.  Pure  protein  was  eluted  in  PBS  buffer  and  fractions  were  stored  at  4  °C.  Purity  was 
assessed  by  SDS-PAGE  (Figure  3.7)  and  further  analysed  by  preparative  SEC  and  SEC-SLS. 

Thermostable  proteins  present  an  interesting  case  when  examined  by  SDS-PAGE.  Due  to 
their  stability  towards  heat,  it  is  difficult  to  completely  dissociate  them  by  conventional 
methods  employed  for  protein  denaturation.  Lsma  and  its  muteins  had  these  issues.  As  seen 
in  Figure  3.7,  most  muteins  along  with  wildtype  Lsma  dissociate  into  the  monomer  (~  9 
kDa),  a  trimeric  state  (~  25  kDa)  in  certain  cases,  a  tetrameric  state  (~  30-40  kDa),  and  for 
some  muteins,  a  very  faint  band  was  also  observed  at  ~  55-60  kDa,  corresponding  to  the 
heptameric  non-denatured  ring.  The  incomplete  dissociation  was  likely  caused  by  the 
presence  of  the  complex  network  of  hydrogen  bonds  and  hydrophobic  interactions  at  the 
subunit  interface,  part  of  which  is  detailed  in  Table  3.2.  However,  it  could  also  indicate  a 
change  in  the  structure  caused  by  the  mutations  at  the  interface  or  a  different  oligomeric  state 


82 


Chapter  three:  Investigation  of  Lsma  interfaces 


Figure  3.7 1  SDS-PAGE  gel  of  purified  Lsma  with  muteins.  Most  muteins  show  similar  bands 
as  Lsma  as  depicted  by  arrows.  The  arrows  correspond  to  the  monomeric  (-9  kDa), 
tetrameric  (-30-40  kDa)  and  the  non-dissociated  (-60  kDa )  states. 

of  the  dissociated  species,  giving  rise  to  different  dissociation  products.  As  observed  from  the 
gel,  there  are  variations  in  the  tetrameric  species  ranging  from  30  kDa  for  R65P  and  179 A,  to 
40  kDa  for  V69A.  These  could  be  attributed  to  the  difference  in  the  oligomeric  state  of  the 
dissociated  product  of  the  muteins  caused  by  the  respective  mutations. 

It  is  interesting  to  point  out  that  among  the  mutations  studied,  L70A  and  171 A  showed  a  near 
complete  dissociation  into  the  monomeric  state,  with  complete  absence  of  the  tetrameric 
species,  when  compared  with  the  other  muteins  (Figure  3.7).  For  L70A,  these  results  were 
consistent  with  the  solution  studies  as  will  be  discussed  in  Section  3.9.1.  Both  L70  and  171 
are  involved  in  hydrogen  bonding  and  form  a  part  of  the  hydrophobic  pocket  at  the  interface 
(Table  3.2).  These  results  indicate  that  among  the  list  of  the  residues  mutated,  these  two 
residues  seem  to  contribute  to  the  stability  of  the  Lsma  ring  by  constituting  an  important 
interaction  at  the  subunit  interface. 
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3.8  Characterisation  of  Lsma 

3.8.1  Solution  state  of  Lsma 

In  order  to  characterise  the  oligomeric  state  of  Lsma  after  cleaving  with  thrombin,  the 
affinity-purified  sample  was  subjected  to  analytical  SEC  and  SEC-SLS  in  PBS  buffer  at  pH 
8.0.  These  findings  were  corroborated  with  the  previous  solution  state  analysis  of  Lsma 
prepared  by  similar  means  at  the  Protein  Structure  Group,  Macquarie  University  17 .  Lsma  had 
a  homogenous  elution  profile  at  a  Kav  0.46  consistent  with  its  single-ring  molecular  weight  of 
~  63  kDa  (Figure  3.8A).  The  monomeric  mass  of  Lsma  calculated  from  its  amino  acid 
composition  is  9.173  kDa  (including  the  extra  Gly  and  Ser  residues  at  the  N-terminal  from 
the  pGEX  plasmid).  The  analytical  SEC  results  indicate  that  the  Lsma  cyclic  oligomer  is 
composed  of  seven  subunits  and  is  consistent  with  the  previous  results  8.  Figure  3.8B  shows 
the  bands  obtained  on  SDS-PAGE  after  being  treated  with  SDS  and  being  boiled  for  5  min. 
The  monomeric  band  is  obtained  at  ~  9  kDa  and  stable  oligomers  at  ~  36  kDa  and  ~  60  kDa. 
The  presence  of  stable  oligomer  bands  indicates  that  the  Lsma  structure  can  tolerate  boiling 
in  the  presence  of  the  detergent  SDS.  The  Lsma  heptameric  oligomer  does  not  completely 
break  down  into  its  monomeric  subunits  being  a  thermostable  protein  and  is  further  stabilised 
by  the  complex  network  of  hydrogen  bonds  and  hydrophobic  interactions  at  the  protomeric 
interfaces  as  noted  in  Section  3.5.  These  stability  results  are  consistent  with  the  thermophilic 
nature  of  the  host  species  M.  thermoautotrophicum. 

SEC-SLS  analysis  of  fraction  I  revealed  that  the  purified  sample  is  homogenous  and  no 
higher  order  molecular  species  are  formed  (Figure  3.8C).  It  is  shown  to  be  monodisperse 
with  a  mass  of  63  kDa.  Thus,  Lsma  remains  as  a  single  toroidal  complex  in  solution  and  there 
is  no  association  of  the  heptameric  rings  into  higher-order  structures. 
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Figure  3.8\  (A)  SEC  chromatograph  of  Lsma  in  PBS  pH  8.0.  (B)  SDS-PAGE  gel  of  pure 
Lsma  after  boiling  for  5  min.  Arrow  indicates  the  non-dissociated  heptameric  ring.  (C)  SEC- 
SLS  of  Lsma  in  PBS  pH  8. 0  at  a  flow  rate  of  0.4  mL/min. 
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3.8.2  Transmission  electron  microscopy  of  Lsma  rings 

Transmission  electron  microscopy  (TEM)  of  Lsma  revealed  individual  rings  of  outer 
diameter  of  ~  6.5  nm  and  inner  pore  diameter  of  ~  2  nm  which  corresponds  well  with  the 
crystal  structure  dimensions.  A  histogram  plot  of  diameter  reveals  a  narrow  distribution  from 
5  nm  to  8  nm  with  the  majority  of  the  rings  at  6  nm  (Figure  3.9).  This  kind  of  distribution 
could  be  attributed  to  the  different  angular  positions  of  the  rings  on  the  TEM  grid  .  TEM 
also  confirms  the  absence  of  any  higher  order  structures  under  these  conditions. 

A 


Diameter  (nm) 


Figure  3.9 1  TEM  analysis  of  Lsma.  (A)  Negative  stain  micrograph  of  well-distributed  single 
rings  of  Lsma.  (B)  Size  distribution  histogram  collected  for  402  particles. 
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3.9  Characterisation  of  Lsma  muteins 

3.9.1  Solution  characteristics  of  muteins 

The  muteins  were  prepared  as  GST-fusion  proteins  and  analysed  by  preparative  SEC  post 
thrombin-cleavage  (as  outlined  in  Section  2.5.3).  The  conditions  of  purification  were  similar 
to  that  of  the  wildtype  Lsma  to  ensure  uniformity  and  valid  comparison  between  them. 

All  muteins  eluted  with  a  mixture  of  oligomeric  species,  with  the  majority  of  the  sample 
eluting  at  Kav  ~  0.4,  which  corresponds  to  a  single  ring  form.  Within  the  error  of  the 
preparative  SEC,  the  oligomeric  state  ranged  from  hexameric  to  heptameric.  The  SEC 
chromatographs  for  all  muteins,  with  their  apparent  molecular  weights,  are  shown  in  Figure 

3.10  (see  legend  for  molecular  weight  values)  with  comparison  with  the  wildtype  Lsma 
(Figure  3.10L).  In  Y78A  (Figure  3.10H),  a  significant  proportion  of  the  sample  was 
aggregated  compared  to  the  other  muteins  and  fraction  III  was  spread  over  a  larger  elution 
volume  with  multiple  peaks.  This  is  inidicative  of  different  oligomeric  states  that  could  not  be 
separated  by  the  S200  matrix. 

In  identical  conditions,  it  was  apparent  that  the  single  residue  changes  at  the  subunit  interface 
changed  the  species  distribution  of  Lsma.  Although  most  muteins  showed  a  heterogeneous 
distribution  on  preparative  SEC,  it  was  evident  that  certain  muteins  had  more  pronounced 
species  than  others.  L70A  (Figure  3.10F)  exhibited  a  mixture  of  species,  with  fraction  IV 
eluting  at  Kav  =  0.6,  indicating  a  monomeric  state.  It  was  difficult  to  carry  out  further  analysis 
on  this  fraction  as  it  degraded  within  a  few  hours  at  4  °C.  Interestingly,  as  discussed  in 
Section  3.7,  SDS-PAGE  of  L70A  (Figure  3.7),  showed  a  complete  dissociation  into  its 
monomer  post-boiling  and  SDS  treatment.  This  is  indicative  that  the  residue  Leu-70  takes 
part  in  crucial  interactions  that  contribute  to  the  stability  of  the  Lsma  ring  scaffold. 
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It  was  also  observed  that  the  majority  of  the  muteins  contained  fraction  II  at  Kav 
corresponding  to  ~  120  kDa.  This  is  suggestive  of  multimerisation  of  the  rings  into  putative 
dimers,  potentially  caused  by  electrostatic  interactions  or  perhaps  suggestive  of  a  different 
type  of  higher  molecular  weight  species  as  reported  in  the  case  of  P.  aerophilum  Sm3  27 .  Due 
to  the  limited  resolving  power  of  the  S200  chromatography  matrix  used,  it  was  difficult  to 
isolate  this  fraction  in  sufficient  quantity  to  carry  out  further  analysis.  For  all  muteins, 
fraction  III  was  collected  and  subjected  to  SEC-SLS  to  determine  the  molecular  weight  and 
polydispersity  of  the  proteins. 

The  preparative  SEC  provided  an  overview  of  the  various  oligomeric  states  adopted  by  Lsma 
and  its  muteins.  From  Figure  3.10,  a  pattern  appears,  reflecting  the  distribution  of  oligomeric 
states  in  solution.  Four  distinct  states  seem  to  be  common  among  the  muteins  analysed.  A 
higher  molecular  weight  (HMW)  species  eluted  at  the  void  volume,  which  was  likely  to  be  an 
aggregate  or  may  represent  an  ordered  form.  A  dimer  of  rings  ([7n2]),  was  observed  for  most 
samples,  likely  to  be  two  rings  associated.  The  majority  of  the  muteins  eluted  at  a  molecular 
weight  consistent  with  a  single  ring  form.  A  small  peak  eluted  at  Kav  corresponding  to  ~  9 
kDa  which  is  consistent  with  the  monomeric  state  of  Lsma,  which  is  pronounced  in  the  case 
of  L70A.  This  state  has  not  been  otherwise  observed  for  Lsm  proteins. 
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i  n  in  IV 
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I  II  UI  IV 
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Figure  3.10\  Preparative  SEC  profiles  of  the  single  site  directed  muteins.  All  muteins  were 
analysed  in  PBS  pH  8.0  on  a  Superdex  200  16/600  column  at  a  flow  rate  of  1  mL/min. 
Fraction  numbers  are  indicated  above.  Dashed  lines  represent  the  higher  molecular  weight 
species  (HMW),  dimer  of  heptameric  rings  ([7n]2),  heptameric  ring  (7n),  and  the  monomeric 
form  (n).  The  molecular  weight  estimated  from  preparative  SEC  is  indicated  in  bracket  A  - 
N10A  (53  kDa);  B  -  Q12A  (51  kDa);  C  -  R65P  (55  kDa);  D  -  V69A  (51  kDa);  E  -  L70A  (52 
kDa);  F  -  171 A  (55  kDa);  G  -  V77A  (50  kDa);  H  -  Y78A  (41  kDa);  I  -  179 A  (54  kDa);  J  - 
S80A  (52  kDa);  K-P81T(50  kDa);  (L)-wildtype  Lsma  (60  kDa). 


90 


Chapter  three:  Investigation  of  Lsma  interfaces 


3.9.2  Static  light  scattering  of  muteins 

SEC-SLS  was  performed  on  fraction  III  collected  post  preparative  SEC.  Figure  3.11  shows 
the  SEC-SLS  results  for  the  muteins  in  PBS  pH  8.0.  Table  3.3  lists  the  molecular  weight 
values  calculated  for  the  muteins  taken  at  the  midpoint  of  the  eluted  peak.  Fraction  III  of  the 
muteins  from  the  preparative  SEC  eluted  as  single  species,  suggesting  that  the  single  ring 
form  is  stable  and  does  not  form  higher  molecular  weight  species.  Except  for  the  muteins 
N10A,  Q12A,  L70A,  I71A  and  P81T,  most  of  the  samples  showed  a  decreasing  trend  in  the 
molecular  weight  (Figure  3.11).  It  may  suggest  a  mixture  of  species  that  could  not  be 
separated  due  to  the  limitation  of  the  S200  separating  matrix.  As  the  peaks  do  not  show 
baselines,  it  was  difficult  to  discern  the  decreasing  molecular  weight  across  the  peak.  The 
decreasing  trend  could  also  suggest  that  a  hexameric  species  is  present  with  the  heptameric 
ring  state.  The  typical  range  of  the  muteins  lay  between  61  kDa  to  64  kDa,  suggestive  of 
heptameric  states.  R65P  and  V77A  exhibited  Mw  range  of  55  kDa  to  57  kDa,  which  are 
closest  to  the  theoretical  molecular  weight  of  a  hexamer  calculated  from  its  amino  acid 
sequence. 
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(A)  N10A  (B)  Q12A 
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(G)  V77A 


(H)  Y78A 


Figure  3.11\  SEC-SLS  of  the  muteins  carried  out  in  PBS  pH  8.0  at  a  flow  rate  of  0.4  mL/min. 
The  dashed  line  indicates  the  elution  volume  of  the  heptameric  ring. 
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Table  3.31  List  of  molecular  weights  (kDa)  obtained  for  muteins  in  comparison  to  Lsma 


Sample 

Molecular  weight a  (kDa) 

Subunitsb 

Lsma 

62 

6.8 

N10A 

62 

6.9 

Q12A 

63 

7.0 

R65P 

57 

6.2 

V69A 

63 

7.0 

L70A 

63 

7.0 

171 A 

59 

6.5 

V77A 

56 

6.1 

Y78A 

61 

6.7 

179  A 

64 

7.0 

S80A 

64 

7.0 

P81T 

59 

6.5 

All  columns  were  pre-equilibrated  with  PBS  pH  8.0. 

aSuperdex  200  GL  10/300  column  was  run  at  0.4  mL/min.  The  Mw  indicated  was  taken  at  the 
midpoint  of  the  eluted  peak. 

bCalculated  by  dividing  the  molecular  weight  from  SEC-SLS  by  theoretical  monomeric  weight  of  9.1 
kDa. 


3.10  Small  angle  X-ray  scattering  by  Lsma  and  muteins 

As  mentioned  in  Section  2.6.7,  the  use  of  small  angle  X-ray  scattering  as  a  technique  to  study 
protein  in  solution  is  becoming  well  established.  Due  to  the  small  wavelength  of  X-rays  (<  1 
nm),  finer  structural  details  can  be  resolved.  SAXS  gives  a  direct  measurement  of  the  average 
scattering  by  proteins  in  solution  28  and  provides  an  accurate  molecular  weight  calculation  29 . 
Since  the  data  are  collected  in  solution,  potential  matrix  interactions  are  eliminated  which  can 
occur  during  chromatography. 
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Based  on  the  molecular  weight  assessment  by  SEC-SLS,  two  muteins,  R65P  and  V77A,  were 
selected  for  SAXS  analysis.  Wildtype  Lsma  and  N10A  were  also  subjected  to  SAXS  as 
controls  for  the  heptameric  composition.  V77A  showed  signs  of  visible  precipitation  during 
the  transport  of  the  protein  sample  and  therefore,  could  not  be  analysed.  Thus,  Lsma,  N10A 
and  R65P  were  analysed  by  SAXS. 

Samples  of  wildtype  Lsma  and  the  muteins  were  analysed  using  the  SAXS  beamline  at  the 
Australian  Synchrotron.  To  ensure  monodispersity,  all  samples  were  injected  on  a  Superdex 
200  5/50  gel  filtration  column  in  line  with  the  SAXS  beamline.  Data  were  continuously 
collected  as  the  samples  eluted  from  the  column  (Section  2.6.7),  The  protein  preparations’ 
scattering  intensities  raw  data  were  radially  averaged  to  obtain  one-dimensional  scattering 
curves  and  scattering  contributions  of  the  buffer  were  subtracted  from  the  sample  scattering 
using  the  program  ScatterBrain  30.  Figure  3.12A  compares  the  experimental  scattering 
intensities  of  the  muteins  with  wildtype  Lsma. 

The  assemblies  yielded  scattering  curves  resembling  elongated  particles.  To  allow  for  a  more 

a  1 

precise  definition  of  molecular  dimensions,  Guinier  plots  were  derived  using  PRIMUS 
(Figure  3.12B).  The  scattering  of  all  the  prepared  samples  yielded  linear  Guinier  plots.  The 
scattering  did  not  show  any  upward  curvature  at  low  angles  indicating  absence  of 
aggregation.  Overall,  however,  good  linearity  signified  data  quality  were  sufficient  for  further 
processing  and  shape  determination.  The  slopes  of  Guinier  plots  yield  Rg  values  in  reciprocal 
space  (Section  2.6.7.1). 
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A  B 


Figure  3.12\  (A)  Scattering  profiles,  (B)  Guinier  plots,  (C)  Electron  distribution  function  (P(r)),  and  (D)  Kratky  plots  of  Lsma  (black),  N10A 
(red),  and  R65P  (green).  Curves  have  been  arbitrarily  displaced  along  the  Y-axis  for  clarity  in  panel  (A)  and  (B).  Plots  are  normalised  with 
respect  to  Y-axis  in  panel  (C)  and  (D). 
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Table  3.41  Geometric  parameters  derived  from  scattering  data  of  Lsm  proteins. 


Sample 

D  a 

is  max 

(nm) 

Rg rec  (nm) a 

Rg real  (nm)  b 

Volume  a 
(nm3) 

Mw  a  (+0. 1  kDa) 

Subunits 

Lsma 

9.6 

2.7+0. 1 

2.8+0.2 

99 

62.0 

6.7 

N10A 

9.9 

2.8+0.2 

2.8+0.2 

99 

63.0 

6.9 

R65P 

9.2 

2.9+0.6 

3.0+0.3 

95 

55.0 

6.0 

a  Calculated  from  Autoporod  32 . 
b  Calculated  from  GNOM  33. 

RSKC,  reciprocal  space  radius  of  gyration  derived  from  Guinier  approximation;  R/eal,  real  space  radius 
of  gyration  from  distance  distribution  function;  Dmax,  maximum  particle  diameter. 

All  data  were  obtained  in  PBS,  pH  8.0. 

For  the  Lsma  heptameric  complex  the  Rgrec  is  calculated  as  2.7+0. 1  nm  and  molecular  weight 
of  62.0+0.1  kDa.  A  comparison  of  all  parameters  is  listed  in  Table  3.4.  Indirect  Fourier 
transformation  of  collected  data  (GNOM  33)  produced  a  smooth  bell-shaped  electron  pair 
distribution  function  ( P(r ))  for  all  complexes  (Figure  3.12C).  For  all  the  complexes  the 
results  are  consistent  with  heptameric  ring  formation  and  not  with  formation  of  any  higher 
molecular  species.  However,  Rgreal  for  R65P  was  3.0+0.3  nm  and  the  molecular  weight 
calculated  from  the  porod  volume  was  55.8+0.1  kDa,  which  corresponds  to  a  hexameric 
composition. 

In  the  case  of  R65P,  the  data  reduction  was  performed  over  four  sets  of  overlapping  frames 
using  ScatterBrain  30,  with  each  set  comprising  of  ten  to  fifteen  frames  of  the  scattering  data, 
collected  as  the  protein  was  eluting  through  the  in-line  size  exclusion  column.  This  allowed 
for  molecular  weight  calculation  covering  the  majority  of  the  eluted  fractions  and 
concentrations,  and  provided  with  an  accurate  determination  of  the  oligomeric  state  of  R65P. 
The  average  weight  obtained  was  56  kDa,  which  is  suggestive  of  a  hexameric  state  of  R65P. 
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Table  3.51  Geometric  parameters  derived  for  different  elution  fraction  of  R65P. 

Number 

of  frames 

D  a 

^ max 

(nm) 

R,rec  (nm) a 

Rg real  (nm)  b 

Volume  a 
(nm3) 

Mw  a  (±0. 1  kDa) 

Subunits 

7 

10 

2.9+0.6 

2.9+0.3 

91 

55 

6.1 

10 

9.8 

2.9+0.6 

2.9+0.3 

96 

56 

6.1 

10 

9.9 

2.9+0.6 

2.9+0.3 

96 

56 

6.1 

7 

9.6 

2.9+0.6 

2.9+0.3 

96 

56 

6.1 

a  Calculated  from  Autoporod  32 . 
b  Calculated  from  GNOM  33. 

Rgrec,  reciprocal  space  radius  of  gyration  derived  from  Guinier  approximation;  Rf'\  real  space  radius 
of  gyration  from  distance  distribution  function;  Dmax,  maximum  particle  diameter. 

All  data  were  obtained  in  PBS,  pH  8.0. 


Figure  3.13 1  Overlay  of  scattering  curves  (non-scaled)  ofR65P  taken  from  different  positions 
of  the  elution  peak,  each  representing  a  different  concentration. 

The  frames  were  chosen  and  analysed  separately  for  the  fractions.  Table  3.5  lists  the 
parameters  collected  for  different  fractions  of  R65P.  The  molecular  weight  was  calculated 
from  the  Porod  volume  using  Autoporod  32.  The  data  suggest  that  different  fractions  of  R65P 
have  similar  molecular  weight  and  all  exhibit  hexameric  composition.  Figure  3.13  shows  the 
overlaid  scattering  curves  of  different  fractions  of  R65P. 

The  real  space  radii  of  gyration  are  in  good  agreement  with  the  reciprocal  space  data  derived 
from  the  Guinier  fits  (Section  2.6.7.1).  It  is  possible  to  deduce  from  these  data  that  all 
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muteins  exist  in  a  folded  globular  state.  All  samples  yield  Kratky  plots  with  a  sharp  peak  at 
low  q  angles  (Figure  3.13D).  The  peak  at  low  angles  was  followed  by  a  plateau,  with  no 
systemic  signal  increase  indicating  a  folded  state. 

3.11  Generation  of  chimeric  Lsm  proteins 

A  second  approach  to  control  the  pore  size  was  tested  by  generating  chimeric  proteins, 
drawing  from  the  sequence  and  structural  information  from  different  species.  By  studying  the 
structural  alignment  of  Lsm  proteins,  it  was  evident  that  the  04-,  loop4  and  05-segments 
make  up  the  protomeric  interfaces  of  Lsm  proteins  16, 34 .  Out  of  these  segments,  the  04-strand 
and  loop4  regions  are  the  most  variable  (Table  3.6).  If  these  segments  are  considered 
modular  in  nature,  then  a  simple  exchange  between  the  Lsm  proteins  would  change  the 
bonding  pattern  at  the  interface  but  at  the  same  time  keep  the  monomer  fold  intact.  The 
formation  of  the  monomer  should  precede  the  oligomeric  ring  formation  leading  to  altered 
assembly  of  monomers  into  different  oligomeric  states. 

It  was  hypothesised  that  the  difference  in  the  molecular  organisation  is  a  result  of  altered 
angle  of  monomer  packing  and  the  differential  extent  of  hydrogen  bonding  interactions  along 
the  interface  (Figure  3.14).  Exchange  of  sequences  of  these  segments,  04  and  05,  between 
the  two  protein  species  was  trialled.  Available  crystal  structures  of  Hfq  (PDB  ID  1HK9)  and 
Lsma  (PDB  ID  1181)  were  studied,  in  particular  the  protomer  interfaces.  04-  and  05- 
segments  of  Hfq,  namely  VSQMV  and  ISTVVP  respectively,  were  inserted  into  the  Lsma 
scaffold,  to  change  the  interface  from  that  of  a  heptameric  oligomer  to  a  hexameric  toroid. 
Such  an  arrangement  was  predicted  to  alter  the  bonding  pattern  at  the  protomeric  interface 
and  influence  the  angle  between  the  monomeric  modules  and  hence  cause  the  assembly  of  a 
hexamer  in  place  of  a  heptamer. 
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Figure  3.14\  Schematic  showing  the  packing  of  monomers  in  a  toroid.  A  change  in  the  angle 
between  the  monomers  was  hypothesised  to  cause  a  change  in  the  oligomeric  state  of  the 
protein. 

Three  such  chimeric  muteins  were  designed  namely: 


Chimera  1  —  ft4-  and  ft 5 -strands  were  replaced  with  corresponding  Hfq  segments. 
Chimera  2  -  ft4-strand  was  replaced  with  ft4-segment  of  Hfq. 

Chimera  3  -  ft  5-strand  was  replaced  with  ft  5 -segment  of  Hfq. 


Table  3.61  Amino  acid  composition  of  Lsm  monomer  interface. 


Protein 

P4 

loop5 

p5 

Ec  Hfq 

VSQMVY 

KHA 

ISTWP 

Mt  Lsma 

EVTRRLGTVLI 

RG 

DNIVYISP 

yeast  Lsm3 

ELSESERRCEMVFI 

RG 

DTVTLIST 

The  constructs  were  designed  and  appropriate  plasmids  ordered  from  Epoch  Life  Sciences 
and  used  to  transform  E.  coli  BL21  cells  (Section  2.4.1).  The  small  scale  expression  gave 
positive  results  with  maximum  expression  at  6  h  post  protein  expression  by  IPTG-induction. 
All  three  chimeric  muteins  were  included  for  large  scale  production  in  400  mL  cultures.  The 
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Figure  3.15\  SDS-PAGE  of  chimeric  muteins.  All  the  protein  aggregated  and  was  present  in 
the  insoluble  fraction  ( indicated  by  arrow). 

SDS-PAGE  analyses,  to  check  the  solubility  of  the  constructs,  revealed  that  all  the  three 
chimeric  muteins  were  insoluble,  being  obtained  in  the  pellet  and  entirely  absent  from  the 
supernatant.  The  changes  introduced  at  the  interface  of  the  Lsma  scaffold  caused  that  protein 
to  form  aggregates  and  not  assemble  into  a  soluble  oligomer  (Figure  3.15). 

As  explained  in  Section  3.5,  the  inter-subunit  interactions  are  a  complex  network  of 
hydrogen  bonds  and  hydrophobic  interactions.  Although  Lsm  proteins  show  high  levels  of 
conservation  at  the  amino  acid  sequence  level,  the  structural  assembly  determines  its 
organisation  into  a  cyclic  oligomer.  As  shown  in  the  experiments  above,  a  simple  change  of 
the  interfacing  residues  prevents  the  ring  assembly,  indicating  that  the  associations  between 
each  monomer  is  highly  intricate  and  conserved  between  species.  These  results  suggest  that 
the  natural  assembly  of  Lsma  was  perturbed  when  the  protomeric  interfaces  were  changed.  In 
order  to  change  its  oligomeric  state,  it  is  necessary  to  refine  the  changes  with  additional  site 
specific  mutations  of  the  relevant  hydrophobic  residues  present  at  each  protomeric  interface. 
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3.12  Summary 

Two  approaches  to  change  the  oligomeric  state  and  inner  pore  diameter  of  Lsma  were  tested 
in  this  chapter.  Single  residue  mutations  were  made  in  the  subunit  interface  to  probe  the 
bonding  between  the  monomers.  In  the  second  approach,  chimeric  proteins  were  generated  by 
changing  the  P4-  and  p5-segments  of  Lsma  to  mimic  that  of  a  hexameric  protein.  Generation 
of  chimeric  proteins  did  not  yield  soluble  protein  and  hence  the  muteins  with  single  residue 
change  at  the  interface  were  tested  for  change  in  the  oligomeric  state  by  biophysical 
techniques. 

The  oligomeric  distribution  of  the  proteins  changed  with  the  introduction  of  the  alanine 
residues  as  assessed  by  preparative  SEC.  It  was  observed  that  the  prominent  fraction  of  the 
muteins  exhibited  molecular  weight  similar  to  that  of  the  wildtype  heptameric  Lsma.  A 
monomeric  state  was  exhibited  by  the  mutein  L70A  as  observed  by  SEC  and  SDS-PAGE.  It 
would  be  interesting  to  optimise  the  conditions  to  stabilise  this  monomeric  form. 

Muteins  R65P  and  V77A  exhibited  molecular  weights  that  are  consistent  with  a  hexameric 
state  as  determined  by  SEC-SLS  and  in  the  case  of  R65P,  confirmed  by  SAXS.  In  order  to 
unambiguously  distinguish  a  heptamer  and  a  hexamer,  future  detailed  work  should  include  X- 
ray  crystallography  and  native  mass  spectrometry. 

It  is  intriguing  to  look  into  the  properties  of  the  R65P  mutein  because  the  residue  is  part  of 
the  p4-strand,  with  its  side-chain  projecting  outwards.  The  change  to  proline  eliminates  the 
positive  charge  and  perhaps  causes  a  bend  in  the  strand,  thereby  inducing  a  change  in  the 
angle  at  the  interface.  This  may  lead  to  the  monomers  associating  as  a  hexamer. 
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4  Chapter  four 

Engineering  tubular  structures  of  Lsma 

4.1  Introduction 

Nanotubes  have  a  number  of  potential  uses  in  nanotechnology,  such  as  biocontainers  for  the 
controlled  release  of  drug  formulations,  or  for  applications  in  materials  and  electronics  \ 
Examples  of  naturally  occurring  protein  tubes  include  microtubules,  pili,  and  flagella. 
However,  these  multi-protein  structures  are  not  ideal  to  work  with  as  it  is  difficult  to  modify 
the  structure  and  control  their  assembly  in  vitro.  Therefore,  nanometric  protein  structures  are 
preferable,  with  a  smaller  diameter  that  can  be  easily  assembled  from  protein  units,  expressed 
in  large  quantities,  and  have  ease  of  surface  modification  to  program  the  length  of  the 
nanotube. 

This  chapter  discusses  two  approaches  to  promote  stacking  of  Lsma  rings  into  nanotubes. 

(i)  Surface  modification  based  on  electrostatic-mediated  stacking  of  the  rings. 

(ii)  Surface  modification  based  on  covalent  linking  of  the  rings  by  disulfide  bridges. 

The  Lsma  variants  designed  were  characterised  by  SEC  and  TEM.  If  Lsma  is  to  be  used  as  a 
molecular  building  block,  there  are  number  of  issues  that  need  to  be  addressed  in  order  to 
make  it  more  attractive  for  downstream  applications.  One  of  these  properties  is  being  able  to 
control  the  supramolecular  assembly  by  altering  the  ionic  or  redox  conditions  of  the  buffer  to 
effectively  provide  a  switch  to  control  the  quaternary  structure  arrangement.  These  studies 
are  discussed  in  this  chapter. 
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Figure  4.1\  (A)  Structural  alignment  of  Lsm  proteins.  The  secondary  structures  are  assigned  with  respect  to  the  Lsma  protein.  The  species 


included  in  the  alignment  are:  Mt  Lsma-M.  thermoautotrophicum;  AfSml  and  Af  Sm2 -A.  julgidus;  Sc  Lsm3-S.  cerevisiae;  Sp  Lsm3-S.  pombe; 
Sa  Hfq-  S.  aureus;  Ec  Hfq-E.  coli.  (B)  Ribbon  representation  of  Lsma  fold  showing  helix  and  /3-strand  elements.  L4  is  the  loop  between  [34- 
and  /35-strands.  (C)  Side  view  of  Lsma  with  a  height  of  3  nm. 
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4.2  Polymerisation  of  Lsm  proteins 

Lsm  proteins  have  the  ability  to  assemble  into  fibres  under  certain  conditions  (Section  1.9). 
In  the  case  of  Lsm  proteins  of  archaeal  origin,  also  termed  Sm-like  archaeal  proteins 
(SmAPs),  from  M.  thermoautotrophicum  and  P.  aerophilum,  both  form  bundles  of  more  than 
200  nm  in  length  and  about  100  nm  in  width  2.  Pae  SmAPl  (PDB  ID  1LNX)  has  the 
tendency  to  form  a  disulfide-bonded  dimer  of  rings  via  the  loop-to-loop  face.  But  under  low 
ionic  strength  buffering  and  reducing  conditions,  the  protein  formed  polymerised  fibres  2 
(Figure  1.9E).  In  the  case  of  Mt  SmAPl  (PDB  ID  1JBM),  the  protein  also  assembles  into 
striated  fibres  as  studied  by  TEM  and  by  examining  the  crystal  structure  packing  (Figure 
1.9D).  The  bacterial  homologue  of  Lsm  protein,  Hfq,  also  exhibits  the  tendency  to  form  large 
macromolecular  structures  3.  It  is  interesting  to  point  out  the  difference  between  the 
assemblies  of  the  fibres  in  the  cases  mentioned  above.  For  Mt  SmAPl,  it  is  head-to-tail 
stacking  of  the  homoheptamers,  whereas  the  Hfq  fibres  are  formed  in  layered  organisation, 
each  layer  formed  by  a  hexamer  of  the  hexameric  rings  3.  These  structures  provide  insight 
into  the  packing  of  Lsm  rings,  and  inform  the  design  of  a  controlled  bottom-up  fabrication  of 
the  rings  in  discrete  steps. 

4.3  Design  rationale  to  engineer  protein  nanotubes 

As  mentioned  in  Section  3.1,  Lsma  exists  as  a  homomeric  ring  containing  seven  repeating 
subunits,  associated  through  a  network  of  hydrophobic  interactions  and  hydrogen  bonds 
between  P-strands  lining  adjacent  protomers.  Due  to  the  radial  symmetry  of  the  associated 
monomers,  a  toroidal  face  is  created  containing  seven  N-terminal  helical  elements,  termed  the 
helix  face,  and  the  obverse  surface  contains  the  loop4,  of  each  monomer,  and  is  termed  as 
loop  face.  Thus,  the  ring  scaffold  provides  four  surfaces  that 
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A 


B 


Figure  4.2\  (A)  Inter-ring  interface  of  Lsma  showing  N10  (black),  R13  (green),  and  E61 
(blue)  as  stick  representation.  (B)  Surface  representation  of  stacks  formed  in  the  crystal 
lattice  of  Lsma  (PDB  ID  1181 )  showing  the  location  of  the  residues  mutated. 

can  be  modified  for  manipulation  of  the  quaternary  organisation  of  the  rings.  The  helix  and 
loop  faces  are  easily  accessible  and  provide  a  large  surface  area  for  ring-to-ring  binding. 

From  the  structural  alignment  of  the  Lsm  sequences  (Figure  4.1A),  it  is  known  that  the  N- 
terminal  and  loop4  regions  are  the  most  variable  segments.  The  oligomerisation  of  the  ring 
itself  is  largely  dependent  on  the  interactions  between  the  P4-  and  P5-strands  as  discussed  in 
Chapter  three.  As  highly  exposed  structural  segments,  the  N-terminal  and  loop4  regions  were 
chosen  as  mutation  sites  and  thereby  targeting  the  opposite  faces  of  the  Lsma  tecton. 
Selecting  the  variable  structural  segments  for  the  mutations  was  not  predicted  to  cause  any 
changes  to  the  heptameric  toroidal  scaffold.  Even  though  the  N-terminal  residues  are  part  of 
the  hydrophobic  pockets  stabilising  the  subunit  interface,  care  was  taken  to  select  residues 
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that  are  solvent  exposed  and  facing  outward  to  allow  probable  interaction  with  another  Lsma 
ring  module. 

A  study  of  the  interface  between  paired  ring  modules  from  the  crystal  packing  in  X-ray 
crystallographic  structures  (PDB  1181)  suggested  that  the  position  of  side  chains  of  Asn-10 
and  Arg-13,  located  at  N-terminal  helix  and  Glu-61  located  within  loop4  were  well-suited  for 
Ala  and  Cys  substitutions  (Figure  4.2A).  Hence,  post  assembly  of  the  ring,  there  would  be 
seven  substitutions  at  the  helix  face  and  the  loop  face  of  the  scaffold,  thereby  conferring  the 
ring  with  seven  potential  attachment  points  for  conjugation.  In  order  to  have  additional 
control  on  the  stacking  of  these  rings,  each  site  was  separately  modified  with  double  and 
single  substitutions. 

(i)  Electrostatically-mediated  ring  stacking 

Electrically  charged  and  acidic  residues  on  the  N-terminal  helix  and  loop  face  were  chosen 
which  were  exposed  towards  the  outside  solvent.  Single  (R13A)  and  double  (N10A/E61A) 
muteins  were  generated  by  changing  specific  residues  to  Ala.  R13  and  N10  are  part  of  the 
helix  face,  whereas,  E61  is  located  on  the  loop  face.  The  archaeal  Lsma  structure  shows  a 
distinct  polarity  in  its  electrostatic  potential,  with  loop4  side  being  more  acidic  than  the  helix 
face  (Figure  4.3).  It  was  hypothesised  that  altering  the  surface  charge  would  cause 
multimerisation  of  the  rings,  as  previously  observed  in  the  case  of  Lsm  polyproteins  4, 
thereby  allowing  control  over  the  stacking  of  rings. 


Ill 


Chapter  four:  Engineering  tubular  structures  of  Lsma 


Figure  4.3\  Surface  electrostatics  of  the  Lsma  ring  complex  showing  the  helix  face  and  loop 
faces.  Inset  shows  the  side  view  of  the  ring.  Structures  were  generated  in  Pymol  5  and 
electrostatic  surfaces  were  computed  with  Adaptive  Poisson-Boltzmann  Solver  (APBS) 
algorithm  6  and  contoured  so  that  red  and  blue  represent  -5kT/e  and  5kT/e  respectively. 


(ii)  Disulfide-mediated  ring  stacking 

Doughnut-shaped  proteins  have  been  engineered  to  polymerise  into  tubes  by  introducing  Cys 
residues  on  the  surface  of  the  ring  scaffold.  There  are  two  literature  precedents  for  this,  each 
using  rings  of  larger  dimensions  than  the  Lsma  tecton  presented  here.  The  first  case  was 
TRAP  protein  from  B.  stearothermophilus,  which  polymerised  in  the  presence  of  the 
reducing  agent,  DTT,  acting  as  a  cross-linker  bridging  the  rings  together  1 .  The  second 
example  is  the  HCP1  protein,  a  secretary  protein  from  P.  aeruginosa,  which  was  shown  to 
assemble  as  stacks  by  disulfide  conjugation.  Furthermore,  the  authors  were  able  to  plug  the 
ends  of  the  tubes,  thereby  creating  a  hollow  protein  capsule  8.  This  work  served  as  an 
inspiration  for  creating  Lsma  tubes.  A  similar  strategy  was  employed  for  polymerisation  of 
Lsma  rings. 
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A  scheme  of  the  possible  combinations  of  stacks  that  can  be  formed  is  depicted  in  Figure  4.4. 
Since  the  Lsma  ring  exhibits  two  chemically  distinct  helix  and  loop  faces,  three  possible 
stacking  events  are  hypothesised;  helix-to-loop,  loop-to-loop,  and  helix-to-helix.  This 
combination  can  either  be  coaxial  or  skewed  in  nature.  The  polymerisation  of  these  basic 
stacked  units  can  give  rise  to  an  extended  tubular  structure  of  varying  length. 


helix  face 


loop  face 


conjugation 
- > 


A 


ionic/redox 

conditions 


coaxial  stacking 


helix  to  loop  loop  to  loop  helix  to  helix 


skewed  stacking 


regular  mixed 

I - 1 

varying  length  of  nanostacks 

Figure  4.4\  Schematic  showing  the  different  possible  combinations  of  the  modified  Lsma 
rings  for  polymerisation  into  tubes. 
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4.4  Preparation  and  isolation  of  recombinant  protein  complexes 

The  muteins  were  generated  as  GST-fusion  products.  It  was  shown  in  Chapter  three,  that  this 
strategy  produces  sufficient  yield  of  the  wildtype  and  the  muteins  with  easy  on-column 
cleaving  of  the  GST-tag.  Although  the  GST-tag  was  attached  to  each  of  the  monomers,  it  did 
not  interfere  with  the  assembly  of  the  oligomeric  ring.  Therefore,  as  a  successful  purification 
methodology,  the  designed  Lsma  muteins  were  generated  as  GST-fusion  products. 

Five  GST-fusion  constructs  namely  R13A,  N10A/E61A,  NIOC,  E61C  and  N10C/E61C  were 
designed  and  the  genes  cloned  in  pGEX  plasmids  coding  for  an  N-terminal  GST-tag  (Section 
2.5.3).  These  constructs  were  ordered  from  Epoch  Life  Sciences  (Section  2.2.3)  and  used  to 
transform  a  conventional  strain  of  E.  coli  cells.  The  protein  products  were  expressed  in  E.  coli 
BL21  cells  following  induction  by  IPTG  (0.2  mM).  The  yield  was  monitored  by  SDS-PAGE 
analysis  and  was  found  to  be  optimal  after  growth  at  26  °C  for  8  h  post-induction.  Typically 
5-10  mg  of  protein  was  produced  per  400  mL  of  growth  media.  All  three  constructs  showed 
good  levels  of  expression  appropriate  for  further  large  scale  preparation. 

All  the  muteins  were  successfully  recovered  using  the  glutathione  Sepharose  affinity  matrix 
(Section  2.5.3).  Figure  4.5  shows  the  SDS-PAGE  gel  for  the  various  steps  of  purification  for 
E61C.  Similar  yields  were  obtained  for  R13A,  N10A/E61A,  NIOC  and  N10C/E61C.  The 
fusion  proteins  were  cleaved  with  thrombin  whilst  still  attached  to  the  column  and  then 
subjected  to  preparative  SEC  to  ascertain  their  size  distribution  and  to  separate  them  from 
contaminating  thrombin  and  small  amounts  of  GST. 
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Figure  4.51  SDS-PAGE  gel  showing  the  affinity  purification  steps  of  recombinant  E61C  from 
E.  coli.  The  cell  lysate  shows  the  complete  protein  content  in  the  cell  after  induction  of 
protein  expression  by  1PTG.  The  wash  is  glutathione  affinity  matrix  wash.  The  eluted  fraction 
represents  the  protein  released  following  thrombin  cleavage  of  bound  fusion  protein.  Bands 
are  obtained  at  ~9  kDa  (monomer),  ~25  kDa  and  40  kDa  (non-dissociated  oligomers ). 

4.5  Characterisation  of  R13A  and  N10A/E61 A 

R13A  and  N10A/E61A  were  subjected  to  analytical  SEC  in  different  salt  concentrations. 
Figure  4.6  shows  the  SEC  traces  obtained  for  both  the  muteins.  A  phosphate  buffer  was  used 
at  pH  8.0,  at  three  salt  concentrations  i.e.  20  mM,  200  mM  and  800  mM.  The  samples  were 
collected  after  preparative  SEC  and  dialysed  in  the  appropriate  buffer  for  12  h  at  4  °C 
(Section  2.6.2).  For  all  the  conditions  tested,  the  elution  volume  of  the  samples  remained  at 
the  Kav  corresponding  to  a  single  ring  (~  63  kDa).  Therefore,  the  stacking  of  Lsma  muteins 
was  not  induced.  As  a  control,  the  wildtype  Lsma  was  also  dialysed  into  the  above  mentioned 
buffers.  It  did  not  exhibit  any  change  in  the  supramolecular  assembly  with  change  in  the  salt 
concentration.  R13A  and  N10A/E61A  were  also  dialysed  against  buffers  with  pH  7,  8,  and  9. 
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R13A  N10A/E61A 


Figure  4.6\  Analytical  SEC  ofR13A  and  N10AJE61A  in  different  salt  concentrations.  (A),  (B), 
and  (C)  show  the  mutein  R13A  in  20  mM,  200  mM,  and  800  mMNaCl,  respectively.  (D),  (E), 
and  (F)  show  the  mutein  N10A/E61A  in  20  mM,  200  mM,  and  800  mM  NaCl,  respectively. 
The  SEC  column  was  equilibrated  with  PBS,  pH  8.0  and  run  at  a  flow  rate  of  0.4  mL/min. 
The  dashed  line  represents  the  single  ring  [7n]  elution  position. 
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The  samples  were  analysed  by  analytical  SEC  and  did  not  show  any  change  in  the  molecular 
weight  (data  not  shown). 

As  mentioned  in  Section  4.2,  fibril  structures  of  Lsma  were  obtained  when  dialysed  into  a 
low  ionic  strength  buffer  2.  Similar  conditions  were  tested  on  Lsma  by  dialysing  the  protein 
against  10  mM  Tris,  pH  7.5,  60  mM  NaCl  for  12  h  at  room  temperature  and  tested  by 
analytical  SEC  and  TEM.  In  the  low  ionic  strength  buffer,  the  Lsma  behaved  as  a  single  ring 
and  did  not  undergo  any  change  into  higher  order  structures  (Figure  4.7 A,  B). 


Figure  4.71  (A)  Analytical  SEC  chromatograph  of  Lsma.  Dashed  line  represents  elution 
volume  of  single  ring.  (B)  TEM  of  Lsma  dialysed  in  10  mM  Tris,  pH  7.5,  60  mMNaCl. 
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4.6  Disulfide-mediated  ring  stacking 

In  order  to  induce  the  formation  of  supramolecular  assemblies  in  which  intact  rings  are 
covalently  associated,  faces  of  the  protein  ring  were  modified  with  Cys  residues  to  allow 
disulfide  conjugation.  The  native  Lsma  protein  sequence  does  not  contain  any  Cys  residues, 
ensuring  there  will  be  no  interference  with  these  newly  engineered  thiol  groups.  The  three 
Cys  muteins,  namely  NIOC,  E61C,  and  N10C/E61C  were  characterised  by  SEC  and  TEM. 

4.6.1  Characterisation  of  E61 C 

E61C  existed  as  a  mixture  of  species  as  judged  by  preparative  SEC  under  non -reducing 
conditions  in  a  phosphate  buffer  at  pH  8.0.  The  trace  obtained  (Figure  4.8A)  shows  a 
distribution  of  three  species.  Fraction  I  eluted  in  the  void  volume,  and  is  most  likely 
indicative  of  aggregated  product  formed  during  the  affinity  purification  step. 

Fraction  II  eluted  at  Kav  0.3  (-110  kDa)  corresponding  to  2  rings  that  are  perhaps  associated 
via  an  axial  or  random  stacking  mechanism.  However,  the  SDS-PAGE  gel  of  the  sample 
showed  little  protein  content  for  fraction  II.  The  majority  of  the  sample  eluted  at  Kav  0.4 
corresponding  to  ~  60  kDa  which  is  the  single  ring  form.  Fraction  III  appeared  as  three  bands 
at  9  kDa  (monomeric  form),  and  40  kDa  and  60  kDa  (thermostable  oligomers)  (Figure  4.8B). 
Since  there  are  no  bands  observed  at  higher  molecular  weights,  it  can  be  concluded  that  E61C 
(fraction  III)  exists  predominantly  as  a  toroidal  ring  with  no  sulfhydryl  associations  in  non¬ 
reducing  conditions. 
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Figure  4.8\  SEC  chromatograph  of  E61C  in  non-reducing  conditions.  (A)  Preparative  grade 
SEC  ( Superdex  200  16/600 )  pre-equilibrated  with  10  mM  PBS  pH  8.0  and  run  at  a  flow  rate 
of  1  mL/min.  (B)  SDS-PAGE  of  fractions  obtained  on  preparative  SEC.  (C)  Analytical  SEC 
run  with  same  buffer  conditions  at  a  flow  rate  0.4  mL/min.  The  dashed  line  indicates  the 
fraction  with  Kav  corresponding  to  a  single  ring.  (D)  SDS-PAGE  of  E61C  in  non-reducing 
and  reducing  conditions. 
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To  check  if  E61C  interconverts  into  other  species,  fraction  III  was  collected  and  injected  in 
an  analytical  SEC  under  reducing  and  non-reducing  conditions. 

The  samples  were  dialysed  into  PBS,  pH  8.0  with  and  without  reducing  agent  for  2  h  at  room 
temperature  prior  to  loading  onto  the  SEC  column.  Figure  4.8C  and  Figure  4.9  shows  the 
SEC  traces  obtained  for  E61C.  Under  both  conditions  the  sample  eluted  as  a  single  species 
with  a  Kav  0.4  (~  63  kDa),  confirming  that  the  species  is  present  in  the  single  ring  state.  The 
SDS-PAGE  showed  the  characteristic  three  bands.  From  the  size  distribution  of  the  sample,  it 
is  evident  that  modification  of  the  loop  face  did  not  result  in  disulfide  conjugation.  Therefore, 
in  the  case  of  E61C,  no  higher  order  assembly  was  observed. 


Figure  4.9\  Analytical  SEC  of  E61C  in  reducing  conditions  (10  mM  TCEP)  at  a  flow  rate  of 
0.4  mL/min.  The  dashed  line  indicates  the  position  of  a  single  ring. 
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4.6.2  Characterisation  of  NIOC 

The  preparative  SEC  trace  in  non-reducing  conditions  obtained  for  NIOC  exhibited  a  mixture 
of  species  similar  to  E61C  (Figure  4.10A).  Fraction  I  eluted  in  the  void  volume,  whereas 
Fraction  II  shifted  to  Kav  0.14,  indicating  stacking  of  ~4  rings  which  is  different  from  E61C. 
The  majority  of  NIOC,  eluted  at  Kav  0.30,  with  the  molecular  weight  calculated 
corresponding  to  two  rings.  This  is  suggestive  of  the  two  rings  linked  via  disulfide  bridges. 
Fraction  III  was  collected  and  subjected  to  analytical  SEC  both  in  reducing  and  non-reducing 
conditions.  The  samples  were  dialysed  into  PBS,  pH  8.0  with  and  without  reducing  agent  for 
2  h  at  room  temperature  prior  to  loading  onto  the  SEC  column. 

(i)  Non-reducing  conditions 

The  elution  profile  from  Figure  4.10C  shows  that  NIOC  eluted  as  a  single  species  but  with 
an  altered  Kav  of  0.30,  indicating  a  molecular  weight  of  ~  110  kDa,  indicative  of  two  rings 
associated  together  in  non-reducing  conditions. 

(ii)  Reducing  conditions 

With  addition  of  a  reducing  agent  (10  mM  TCEP),  the  elution  peak  shifted  to  Kav  0.4, 
corresponding  to  ~  60  kDa  (Figure  4.11),  and  hence  a  single  ring  state.  Thus,  under  reducing 
conditions,  the  dimers  are  dissociated. 

SDS-PAGE  in  reducing  and  non-reducing  conditions  reveals  results  consistent  with  SEC. 
Figure  4.10C  shows  NIOC  in  higher  molecular  weight  states  in  the  non-reduced  condition. 
The  lowest  band  at  ~  18  kDa  corresponds  to  two  monomers  associated  via  disulfide  bonds. 
There  is  formation  of  a  ~  50  kDa  species,  followed  by  higher  order  forms  above  60  kDa,  with 
a  complete  absence  of  the  tetrameric  species.  Whereas,  in  the  reduced  state,  the  bands  are 
typical  of  Lsma  protein,  with  species  at  ~  9  kDa,  ~  40  kDa  and  ~  50  kDa. 
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Figure  4.10\  SEC  chromatograph  of  NIOC.  (A)  Preparative  grade  SEC  (Superdex  200 
16/600)  pre-equilibrated  with  10  mM  PBS  pH  8.0  and  run  at  a  flow  rate  of  1  mE/min.  (B) 
SDS-PAGE  of  fractions  from  preparative  SEC.  (C)  Analytical  SEC  run  with  the  same  buffer 
conditions  at  a  flow  rate  0.4  mL/min.  The  dashed  line  indicates  the  fraction  with  Kav 
corresponding  to  a  single  ring.  (D)  SDS-PAGE  of  NIOC  in  non-reducing  and  reducing 
conditions. 
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Hence,  the  data  are  consistent  with  ring  formation  being  followed  by  disulfide  linking  of  two 
rings,  resulting  in  helix -to-helix  stacking.  This  could  be  attributed  to  the  fact  that  the  N- 
terminal  structural  elements,  including  the  N10  residues,  are  more  flexible  and  exposed  than 
the  loop  elements,  thereby  available  for  thiol-thiol  associations,  in  contrast  to  the  data 
presented  for  E61C.  This  results  in  a  locked  dimer  of  rings. 

Hence,  the  helix  face  is  more  conducive  towards  linking  but  results  in  a  dimer  of  rings,  not 
capable  of  further  polymerisation  and  extension  to  nanotubes.  The  loop  face  does  not  link 
together  due  to  structural  constraints,  and  therefore  has  a  functionalised  face  available  for 
further  polymerisation  reactions  with  the  double  Cys  mutein,  N10C/E61C.  The  surface  onto 
which  the  Cys  residues  are  engineered  affects  the  conjugative  properties  of  the  toroidal 
scaffold. 


Figure  4.11\  Analytical  SEC  of  NIOC  in  reducing  conditions  (10  mM  TCEP)  at  a  flow  rate  of 
0.4  mL/min.  The  dashed  line  indicates  the  elution  volume  of  a  single  ring. 
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4.6.3  Characterisation  of  N10C/E61C 

The  double  mutein  N10C/E61C  was  analysed  by  preparative  SEC.  Since  there  are  engineered 
Cys  residues  on  the  two  faces  of  the  ring  scaffold,  the  solution  state  of  the  sample  was 
assessed  in  reducing  and  non-reducing  conditions.  The  samples  were  dialysed  into  PBS,  pH 
8.0  with  and  without  reducing  agent  for  2  h  at  room  temperature  prior  to  loading  onto  the 
SEC  column. 

(i)  Non-reducing  conditions 

Preparative  SEC  revealed  a  mixture  of  species  in  phosphate-based  buffer.  As  shown  in 
Figure  4.12A,  the  majority  of  the  sample  eluted  in  the  void  volume,  indicating  a  species  > 
1300  kDa  (fraction  I).  Fraction  II  elutes  at  Kav  0.20  corresponding  to  ~  340  kDa,  indicating  a 
probable  stacking  of  ~5  rings.  Fraction  III  corresponds  to  a  Kav  0.5.  Since  the  formation  of 
disulfide  bridges  was  hypothesised  between  the  rings,  leading  to  the  generation  of  larger 
molecular  weight  species,  fraction  I  was  collected  and  analysed  further  by  analytical  SEC  and 
TEM.  Since,  the  higher-order  structures  were  formed  during  the  purification  of  the  sample, 
the  disulfide  association  of  N10C/E61C  rings  is  a  spontaneous  event  most  likely  being 
oxidised  when  exposed  to  air.  Furthermore,  to  confirm  the  nature  of  conjugation,  N10C/E61C 
was  also  subjected  to  reducing  conditions  in  preparative  SEC. 

(ii)  Reducing  conditions 

In  reducing  conditions,  N10C/E61C  exhibited  a  drastic  change  in  the  oligomeric  distribution 
as  assessed  by  SEC  (Figure  4.12B).  Contrary  to  the  mixture  of  species  obtained  in  a  non¬ 
reducing  environment,  in  the  presence  of  10  mM  TCEP,  the  equilibrium  shifted  towards  a 
predominantly  single-ring  species.  Fraction  I  was  obtained  in  the  void  volume,  which  could 
indicate  the  non-reduced  higher  molecular  species  being  >1300  kDa.  Fraction  II  had  a  Kav 
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Figure  4.12\  Preparative  grade  SEC  of  N10C/E61C.  Superdex  200  16/600  column  pre¬ 
equilibrated  with  10  mM  PBS  pH  8.0  and  run  at  a  flow  rate  of  1  mL/min.  (A)  Non-reducing 
conditions  and  (B)  Reducing  conditions  (10  mM  TCEP). 
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0.2,  indicating  stacks  of  2-3  rings.  The  majority  of  the  sample  was  obtained  as  Fraction  III 
with  Kav  corresponding  to  the  single  ring  form.  This  is  consistent  with  the  nature  of  the 
linkage  between  the  rings  occuring  via  disulfide  bridges.  Fraction  III  was  collected  and 
analysed  by  analytical  SEC. 

N10C/E61C  eluted  as  a  single  species  on  analytical  SEC.  Inspection  of  the  chromatograph 
showed  the  presence  of  shoulders  which  may  indicate  stacking  events  even  in  the  presence  of 
reducing  agent.  A  large  portion  of  the  sample  was  completely  reduced  to  single  rings  in  the 
presence  of  molar  excess  of  reducing  agent  (Figure  4.13A).  Consistent  with  the  solution 
studies,  the  SDS-PAGE  analysis  showed  higher-order  oligomeric  forms  in  non-reduced 
conditions  (Figure  4.13B). 

A 


CM 

< 


Figure  4.13\  (A)  Analytical  SEC  of  N10C/E61C  in  10  mM  PBS  pH  8.0,  10  mM  TCEP  at  a 
flow  rate  0.4  mL/min.  The  dashed  line  indicates  the  position  of  a  single  Lsma  ring  on  the 
analytical  column.  (B)  SDS-PAGE  showing  reduced  and  non-reduced  sample  ofN10C/E61C. 
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4.7  TEM  of  protein  stacks 

N10C/E61C  spontaneously  assembled  into  nanostacks  as  observed  by  negative  stain  TEM. 
As  discussed  in  Section  4.3,  the  ring-to-ring  stacking  can  be  axial  i.e.  they  share  a  common 
axis,  or  skewed  i.e.  the  tube  axis  is  staggered  and  misaligned.  Figure  4.14A  shows  the 
organisation  of  rings  into  tubes  of  length  ranging  from  10  nm  to  50  nm,  suggesting  about 
three  to  sixteen  rings  (the  height  of  a  single  ring  being  3  nm)  associated  in  axial  stacking. 
From  the  size  distribution  histogram,  the  majority  of  the  tubes  were  ~20  nm  in  length  (Figure 
4.14B).  Although  most  of  them  were  linear  in  geometry,  curved  tubes  were  also  observed, 
indicating  the  presence  of  skewed  stacking.  The  same  sample,  in  the  presence  of  reducing 
agent,  disassembles  into  single  rings  as  shown  in  Figure  4.14C.  Thus,  the  gross  organisation 
of  Lsma  can  be  controlled  by  changing  the  redox  environment. 

4.8  Polymerisation  of  nanotubes 

As  discussed  in  Section  4.3,  other  ring-shaped  proteins  have  been  engineered  to  assemble  as 
stacks.  In  order  to  control  the  length  of  N10C/E61C,  the  two  ends  of  the  tube  were  blocked 
using  the  single  Cys  version  of  Lsma.  As  observed  from  the  solution  behaviour  of  the  two 
single  Cys  constructs,  NIOC  is  most  likely  constrained  into  a  head-to-head  dimer,  thereby 
removing  the  ability  to  form  thiol-thiol  associations  with  other  modules.  NIOC  and  E61C 
have  one  functionalised  face  capable  of  forming  disulfide  linkages  through  the  exposed 
sulfhydryl  (-SH)  groups,  with  E61C  showing  the  cleanest  behaviour  in  solution  (Section 
4.6.1). 


127 


Chapter  four:  Engineering  tubular  structures  of  Lsma 


A 


B 


Length  of  tubules  (nm) 


Figure  4.14\  Negative  stain  transmission  microscopy  of  N10C/E61C.  (A)  Micrograph 
showing  formation  of  tubes  prepared  in  a  phosphate  buffer.  (B)  Histogram  showing  the 
distribution  of  length  of  tubes.  (C)TEM  of  N10C/E61C  in  presence  of  the  reducing  agent. 
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The  samples  were  prepared  by  dialysing  into  PBS  pH  8.0  containing  10  mM  TCEP  for  6  h  at 
room  temperature,  to  ensure  that  the  thiol  groups  are  reduced.  Different  stoichiometries  of 
N10C/E61C  and  E61C  were  mixed  together  and  dialysed  into  PBS,  pH  8.0  buffer  to  remove 
the  reducing  agent  alongside  providing  sufficient  interaction  time.  The  samples  were 
analysed  by  non-reducing  SDS-PAGE  to  assess  the  formation  of  higher  order  structures. 

As  seen  from  Figure  4.15A,  higher  molecular  weight  bands  are  obtained  on  the  SDS-PAGE 
when  the  two  constructs  are  mixed  together.  It  was  observed  that  there  is  a  shift  in  the 
molecular  weights  by  an  increment  of  ~  9  kDa.  This  suggests  that  there  are  disulfide  linkages 
between  the  monomers  of  N10C/E61C  and  E61C  which  decreases  as  the  ratio  decreases.  The 
size  distribution  of  the  tubules  also  changes  and  the  length  distribution  is  tighter  when 
compared  with  spontaneous  tube  formation  (Figure  4.16C).  Therefore,  a  higher 
concentration  of  N10C/E61C  leads  to  longer  polymerised  chains  and  hence  the  appearance  of 
more  number  of  discrete  bands  on  the  gel. 
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regular  mixed 


Figure  4.15\  (A)  Non-reducing  SDS-PAGE  showing  formation  of  higher  order  structures  with 
capping  of  N10C/E6 1C  with  E61C  module.  1 -marker;  2-N10C/E61C  (reduced);  3-E61C 
(reduced).  Lanes  4  to  8  show  ratios  of  N10C/E61C  :  E61C.  4  -3:1;  5  -  2:1;  6  - 1:1;  7  -  1:2;  8 
-  1:3.  (B)  Schematic  showing  the  possible  combinations  of  stacking  obtained  by  mixing 
N10C/E61C  with  E61C. 
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Figure  4.16\  TEM  micrographs  showing  N10C/E61C  and  E61C  in  (A)  3:1  and  (B)  1:3  ratios 
corresponding  to  the  SDS-PAGE  in  Figure  4.1 3.  (C)  Size  distribution  histogram  of 
N10C/E61C  mixed  with  E61C  in  3:1  ratio. 
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4.8.1  Cross-linking  of  N10C/E61C 

As  demonstrated  in  Sections  4.6.2  and  4.6.3,  the  artificially  inserted  Cys  residues  can  induce 
the  disulfide  linkage  and  formation  of  tubes,  however,  as  observed  from  TEM  (Figure 
4.14A),  there  are  single  rings  present  along  with  the  nanotubes,  indicating  dissociation  of 
rings  from  the  tubular  structure  or  not  being  incorporated  into  the  tube.  This  could  be  caused 
by  the  local  surface  effects  created  as  the  sample  was  being  deposited  on  the  grid  surface.  To 
ensure  a  more  rigid  and  stable  structure,  other  chemical  linkages  were  tested  to  enhance  the 
formation  of  higher  molecular  weight  species. 

Cross-linking  using  reactive  reagents  was  tested  to  control  the  polymerisation  of  N10C/E61C. 
Homo-bifunctional  crosslinkers  have  been  used  to  determine  the  arrangement  of  subunits 
with  homo-oligomeric  proteins  9,  and  to  study  the  conformational  changes  in  proteins.  Two 
types  of  cross-linkers  were  trialled  to  observe  the  assembly  of  N10C/E61C  in  their  presence. 

Bismaleimides  are  a  class  of  compounds  with  two  maleimide  groups  connected  by  the 
nitrogen  atom  via  a  carbon  linker  (Figure  4.17A)  and  are  commonly  used  as  cross-linking 
reagents  in  polymer  chemistry  and  to  characterise  protein  structures  and  interactions  10.  In 
this  study,  bis(maleimide)ethane  (BMOE)  is  used  as  a  cross-linker,  containing  a  two  carbon 
linker,  to  specifically  react  with  the  engineered  sulfhydryl  groups  on  the  surface  of 


Figure  4.17\  Structures  of  (A)  bis(maleimido)ethane  and  (B)  glutaraldehyde. 
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N10C/E61C.  This  is  an  alternative  method  to  form  stable  tubes  via  the  thiol-maleimide 
chemistry. 

Reaction  of  a  sulfhydryl  to  the  maleimide  group  results  in  the  formation  of  a  stable  thioether 
linkage,  which  cannot  be  cleaved  by  reducing  agents  or  physiological  buffer  conditions  n. 
Figure  4.18A  shows  the  thiol-maleimide  reaction.  The  reaction  of  maleimides  is  very 
specific  to  sulfhydryls  at  pH  6.5-7. 5  and  they  do  not  react  with  Tyr,  His  or  Met  residues  12. 
The  maleimide  moiety  is  temporarily  stable  in  aqueous  solutions  devoid  of  reactive 
sulfhydryl  targets,  but  hydrolysis  to  a  nonreactive  maleimide  acid  can  occur  during  storage  at 
pH  >  8.  Hydrolysis  of  the  ring  structure  can  also  occur  following  conjugation,  resulting  in  an 

1  Q 

open-ring  linkage  (Figure  4.18A). 

Glutaraldehyde  (GA)  is  a  bi-functional  cross-linker  containing  two  aldehyde  groups  on  a  five 
carbon  chain  (Figure  4.17B).  GA  is  most  commonly  used  in  enzyme  technology,  leather 
tanning,  and  as  a  fixative  for  microscopy  14.  In  this  study,  GA  was  used  to  link  the 
N10C/E61C  rings  into  an  irreversible  tube  form  and  further  serve  as  a  comparison  to  the 
maleimide  cross-linking  agent. 
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Figure  4.18\  (A)  Reaction  of  maleimide-activated  compounds  to  sulfhydryls.  Adapted  from 
Heredia  et  al.  16 .  (B)  Proposed  mechanism  of  pyridium  cross-links .  Adapted  from  Meade  et 
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GA  is  known  to  react  with  several  functional  groups  of  proteins,  including,  amine,  thiol, 
phenol  and  imidazole  moieties  15.  The  most  reactive  species  towards  GA  are  the  s-amino 
groups,  followed  by  the  a-amino  groups,  which  correspond  to  the  primary  amino  groups 
present  in  the  Lys  residues,  and  the  N-terminal  amino  group  present  in  proteins,  respectively 
14 .  Since,  N10C/E61C  also  contains  engineered  thiol  groups,  the  reaction  of  GA  via  thiol 
groups  also  has  to  be  taken  into  account.  The  mechanism  by  which  GA  reacts  with  the  amino 
groups  of  proteins  is  still  under  debate.  One  possible  mechanism  is  via  the  formation  of  a 
pyridinium  cross-link,  as  shown  in  Figure  4.18B. 

(i)  Sulfhydryl  cross-linking 

BMOE  was  used  to  link  the  N10C/E61C  ring  modules  through  their  exposed  Cys  residues. 
Since  N10C/E61C  instantaneously  forms  the  elongated  stacks,  it  was  reduced  by  dialysing  in 
a  phosphate  buffer,  pH  7.5,  containing  10  mM  TCEP  for  2  h  prior.  TCEP  was  chosen  as  it 
does  not  interfere  with  the  maleimide  chemistry.  The  reduced  N10C/E61C  (5  mg/mL)  was 
filtered  using  a  0.2  pm  syringe  filter  before  adding  BMOE.  Care  was  taken  to  make  fresh 
solutions  of  BMOE  to  avoid  hydrolysis.  The  mixture  was  incubated  for  2  h  and  the  reaction 
stopped  by  adding  50  mM  DTT  to  the  reaction  mix.  15  pL  aliquots  were  taken  and  loaded  on 
the  SDS-PAGE.  Figure  4.19A  shows  the  SDS-PAGE  of  N10C/E61C  incubated  with  BMOE, 
at  increasing  concentration.  Reduced  N10C/E6 1C  (Figure  4.19A,  lane  1)  exhibits  three  bands 
whereas  on  controlled  addition  of  the  cross-linker,  there  were  higher  molecular  weight 
species  present.  The  samples  investigated  with  SDS-PAGE  showed  that  N10C/E61C 
incubated  with  a  very  low  concentration  of  BMOE  (>  0.062  mM,  lane  7  and  lane  8)  exhibited 
formation  of  discrete  bands  indicating  cross-linking  of  the  subunits.  Whereas,  higher 
concentration  (>0.125  mM)  shows  smearing  on  the  gel  which  suggests  non-specific  cross- 
linking. 
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(ii)  Glutar aldehyde  cross-linking 

GA  is  a  homo-bifunctional  reagent  that  reacts  with  the  primary  amine  groups  i.e.  s-amino 
groups  of  Lys  residues,  capable  of  forming  inter-  and  intra-subunit  covalent  bonds. 
N10C/E61C  was  cross-linked  with  GA  to  compare  the  pattern  obtained  to  that  obtained  with 
the  use  of  BMOE.  The  sample  was  prepared  as  described  above  with  omission  of  the 
stopping  of  the  reaction  with  DTT.  The  SDS-PAGE  (Figure  4.19B)  showed  smears  for  all 
concentrations  of  GA  tested,  indicating  that  there  were  non-specific  cross-linking  events.  GA 
has  the  capacity  to  create  large  and  often  poorly  defined  protein  aggregates  and  is  difficult  to 
control  the  intra-  and  intermolecular  cross-linking  18.  Maleimide  chemistry  proves  to  be  more 
effective  in  cross-linking  N10C/E61C  than  using  glutaraldehyde  as  observed  from  the  extent 
and  nature  of  bands  formed  by  SDS-PAGE.  The  degree  of  linkage  is  more  controlled  in  the 
case  of  N10C/E61C  being  incubated  with  E61C. 
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Figure  4.19\  Non-reducing  SDS-PAGE.  (A)  N10C/E61C  with  BMOE  for  30  min  at  room 
temperature;  M-ladder;  1-20  mM;  2-10  mM;  3-1  mM;  4-0.5  mM;  5-0.25  mM;  6-0.125  mM; 
7-0.062  mM;  8-0.031  mM;  9-reduced  N10C/E61C.  (B)  N10C/E61C  with  GA  incubated  for  5 
min  at  room  temperature;  M-ladder;  1-  reduced  N10C/E61C;  2-0.025  mM;  3-0.5  mM;  4-1 
mM;  5-2  mM;  6-2.5  mM;  7-5  mM.  Arrows  indicate  the  monomeric  form  (-9  kDa),  tetrameric 
form  (-35  kDa)  and  the  higher  order  forms. 
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4.9  Summary 

Lsma  consists  of  two  chemically  distinct  faces  i.e.  the  helix  face  and  the  loop  face.  Two 
approaches  were  tested  to  manipulate  the  quaternary  structure  of  the  rings  and  let  them  self- 
assemble  into  stacks  of  rings.  The  muteins  that  were  engineered  to  include  Cys  residues 
polymerised  into  tubes  by  disulfide  conjugation.  In  the  case  of  N10C/E61C,  the  associations 
can  take  place  through  either  of  the  two  faces  i.e.  helix-to-loop,  loop-to-loop  and  helix-to- 
helix.  As  observed  from  the  size  distribution  histogram,  the  average  length  of  the 
spontaneously  formed  tubes  is  ~20  nm,  suggesting  that  polymerisation  includes  up  to  ~7 
rings  conjugated  in  an  extended  tubular  arrangement.  From  the  micrographs,  the  majority  of 
the  stacks  are  linear,  suggesting  axial  stacking  of  the  ring  scaffold.  A  minority  of  the  stacks 
are  skewed,  suggesting  random  association  of  rings  through  thiol-thiol  interactions. 

Incubation  of  N10C/E61C  with  E61C  provided  some  insight  into  the  orientation  of  stacking 
of  the  modified  Lsma  rings.  Since  only  the  loop  face  is  available  for  conjugation  for  E61C 
and  it  was  found  that  N10C/E61C  conjugates  with  E61C  (Figure  4.15),  these  observations 
lead  to  the  conclusion  that  the  majority  of  the  linkages  were  in  a  head-to-tail  manner.  E61C 
presumably  acts  as  a  chain  terminating  module  at  one  end.  In  order  to  comment  on  the  exact 
nature  of  the  Lsma  tube  structure,  high  resolution  structural  data  is  required. 

The  results  are  consistent  with  the  tubes  being  formed  by  direct  disulfide  links  between  the 
engineered  Cys  side  chains.  From  the  solution  studies  performed  on  N10C/E61C,  the  tubes 
completely  dissociate  on  addition  of  a  reducing  agent,  thereby  confirming  that  the 
associations  are  mediated  through  disulfide  bridge  formation.  Considering  the  fact  that  there 
are  seven  Cys  residues  on  both  faces  of  the  Lsma  ring,  a  skewed  non-axial  linking  of  the 
rings  cannot  be  eliminated.  As  seen  from  the  TEM  micrograph  (Figure  4.14A),  some  of  the 
tubes  are  non-axially  stacked.  This  may  arise  due  to  incomplete  alignment  of  the  rings  on  top 
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of  each  other  but  resulting  in  disulfide  conjugation  nonetheless.  The  schematic  in  Figure  4.4 
sums  up  the  various  arrangements  available  to  the  Cys  modified  Lsma  rings. 

By  generating  a  synthetic  interface  between  the  ring  modules,  high  aspect-ratio  tubes  were 
created,  otherwise  not  observed  in  the  native  state  of  Lsma.  The  nature  of  these  associations 
is  shown  to  be  through  disulfide  bridge  formation.  This  feature  provides  an  additional  control 
over  the  assembly  and  disassembly  of  the  proteinaceous  tubes. 

The  various  interactions  present  and  observed  in  Lsm  proteins  which  result  in  the  formation 
of  higher  oligomeric  structures  of  stacks  and  fibres  include  salt  bridge  formation  19,  His 
interactions  19;  20 ,  disulfide  linking  2,  and  electrostatic  interactions  20;  21 .  These  interactions 
result  in  either  stacking  of  the  ring  modules  to  form  14-mer  species  or  result  in  assembly  of 
fibril  ultrastructures.  As  mentioned  in  Section  4.2,  higher  order  oligomerisation  properties 
have  been  observed  for  other  archaeal  Lsm  proteins.  Mt  SmAPl  (PDB  ID1LOJ)  also  exhibits 
oligomerisation  properties  assembling  into  fibres  stretching  to  several  nanometres  in  length. 
These  were  obtained  by  treating  purified  protein  in  very  low  salt  conditions  which  resulted  in 
head-to-tail  stacking  of  the  heptamers  in  low  ionic  strength  buffer  2.  The  same  conditions 
were  used  to  induce  fibrogenesis  in  the  case  of  Lsma  (Section  4.2).  It  is  interesting  to  point 
out  the  difference  in  the  preparation  of  the  proteins.  Mt  SmAPl  was  constructed  as  a  C- 
terminally  His6-tagged  protein,  which  was  followed  by  trypsin  cleaving  to  remove  the  tag. 
The  N-terminus  of  the  protein  did  not  have  any  extra  residues  post-cleavage.  On  the  other 
hand,  the  strategy  used  in  this  study  involved  using  GST  as  the  affinity  tag,  which  results  in 
the  incorporation  of  two  extra  residues,  Gly  and  Ser,  at  the  N-terminus  after  protease 
cleavage.  The  fact  that  protein  preparations  had  the  extra  residues  could  contribute  to  it 
failing  to  produce  fibres  by  head-to-tail  stacking  when  treated  in  low  ionic  strength  buffers. 
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Although  various  proteins  exist  that  naturally  assemble  into  filaments  and  tubular  structures, 
these  are  not  optimal  for  nanotechnological  applications  22:  23.  The  features  required  for 
downstream  applications  include  ease  of  modification,  in  vitro  control  over  assembly,  and 
easily  accessible  structural  features.  Doughnut-shaped  proteins  allow  in  vitro  assembly  post¬ 
expression  of  the  self-assembling  ring  module.  As  discussed  in  Section  4.3,  other  ring-shaped 
proteins  have  been  engineered  to  generate  tubular  structures,  with  pore  dimension  ranging 
from  13  nm  to  3  nm  ’  .  Lsma  tubes  offers  a  pore  size  of  ~  1.5  nm  which  is  a  new  dimension 
reached  compared  to  other  the  protein  tubes  generated  by  genetic  engineering  methods. 
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5  Chapter  five 

Preparation  and  biophysical  characterisation  of  complexes 
of  HiS6-tagged  Lsma:  evidence  for  higher  order 
organisation 

5.1  Introduction 

It  is  useful  to  trigger  a  change  in  the  quaternary  structure  with  metals  or  to  use  a  biological 
structure  to  template  metal  assembly.  Previous  studies  on  using  biomolecules  for 
metallisation  have  yielded  components  for  electronic  circuits  1;  2,  templates  to  create  NPs  3’  4 
and  nano  wires  5.  Much  more  recently,  another  doughnut-shaped  protein  system, 
peroxiredoxin,  has  been  used  to  assemble  Au  NPs  into  one  dimensional  space  6. 

As  a  purification  strategy,  a  poly-His  tag  was  employed  as  an  affinity  tag.  This  approach 
served  a  dual  advantage.  Along  with  purifying  the  protein,  the  tag  conferred  a  multivalent 
display  of  functional  groups  for  each  of  the  Lsma  monomers.  The  ready  access  to  imidazole 
moieties  with  their  affinity  for  metal  ions,  was  used  to  generate  higher  order  architectures  for 
the  Lsma  system.  This  chapter  describes  the  effects  of  attachment  of  a  His6-tag  to  the  termini 
of  Lsma  polypeptides  and  their  resulting  assembly  into  higher  molecular  weight  species. 
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5.2  Recombinant  design  and  production  of  His6-tagged  Lsma 

The  P-propeller  of  the  Lsma  assembly  forms  a  robust  core,  leaving  both  the  N-  and  C-termini 
of  each  monomer  exposed  and  available  for  modification  (as  discussed  in  Section  1.1).  A 
consideration  of  the  Lsma  heptamer  structure  in  Figure  5.1B,  shows  the  N-terminal  residues 
as  seven  flexible  segments  preceding  the  a-helices  circling  one  face  of  the  ring.  While  the 
Lsma  C-terminal  segments  are  not  specifically  defined  in  the  crystal  structures,  it  is  likely 
that  these  extend  out  from  the  equatorial  locations  around  the  ring,  i.e.  remote  from  the 
helical  face  (Figure  5.1B).  Therefore,  the  ring  scaffold  provides  two  positions  for  the 
placement  of  His-rich  segments  in  each  of  the  seven  monomeric  units  of  Lsma  heptamer 
(Figure  5.1A).  In  order  to  assess  the  properties  conferred  to  the  Lsma  scaffold  by  addition  of 
the  His6-tag  at  each  position,  constructs  were  generated  by  cloning  into  plasmids  that 
incorporated  His6-tags  at  the  N-terminus  and  C-terminus  of  the  protein. 

Two  plasmids  designed  to  express  the  fusion  constructs,  [H6Lsma]7  and  [LsmaH6]7  were 
generated  by  cloning  the  Lsma  gene  into  plasmid  pET24a,  ordered  from  Epoch  Biolabs 
(Section  2.2.3).  These  were  used  to  transform  the  BL21(DE3)  strain  of  E.  coli  (Section 
2.4.1),  Small-scale  production  was  used  to  assess  the  relative  levels  of  protein  expression 
across  several  growth  protocols  with  variation  in  media,  expression  strain  and  induction 
protocols  (Section  2.4.2),  The  best  yields  were  obtained  by  expressing  the  protein  at  an 
OD600  of  0.6  by  IPTG  (0.2  mM)  and  grown  further  for  6  h  at  26  °C.  SDS-PAGE  analysis  was 
used  to  verify  the  optimum  growth  conditions  for  the  two  fusion  proteins  (Figure  5.2).  MRE5 
cells  did  not  show  any  expression  of  the  His6-tagged  Lsma  proteins  under  the  conditions 
tested  in  the  small  scale  expression  trials  (Section  2.4.2),  and  were  not  pursued  as  an 
expression  strain.  Figure  5.2C  shows  the  total  protein  content  in  the  MRE5  cells  after  protein 
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Figure  5.1 1  Sequence  and  ribbon  diagram  of  Lsma  (PDB  ID  1181)  depicting  the  placement  of  His  <5  segments.  (A)  Lsma  sequence  showing  His  6 
segment  at  the  N-  and  C- termini.  Dots  between  the  sequences  represent  omitted  residues.  (B)  Ribbon  representation  of  the  Lsma  fold  with  f- 
strands  labelled.  The  overall  placement  of  the  His6  segment  is  also  shown  after  self-assembly  into  a  ring  scaffold.  Images  were  prepared  in 

y 

Pymol  . 
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Figure  5.2 1  SDS-PAGE  showing  protein  composition  for  cells  containing  Lsma  constructs.  Protein  content  analysed  following  growth  of  hosts 
for  (A)  [H^Lsma]  7  in  BL21(DE3)  cells,  (B)  [LsmaH6]7  in  BL21(DE3)  cells  and  (C)  expression  in  MRE5  strain  showing  the  absence  of  His 5- 
tagged  Lsma  with  1PTG  induction  in  the  crude,  grown  at  26  °C.  The  gel  was  run  at  200  V  for  35  mins  at  room  temperature  before  staining  with 
Simply  blue  stain. 
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expression  was  induced  by  IPTG.  There  is  no  noticeable  expression  of  the  protein.  Due  to 
reasonable  levels  of  expression  in  the  soluble  fraction  in  the  BL21(DE3)  strain,  both  the 
constructs,  [H6Lsma]7  and  [LsmaH<5]7,  were  selected  for  further  large-scale  preparation. 

As  was  found  for  the  untagged  protein  (Section  3.7),  [HeLsma]-?  ran  as  multiple  bands  on  a 
denaturing  SDS-PAGE  gel,  at  10,  40  and  50  kDa  (Figure  5.2A),  consistent  with  the  wildtype 
Lsma  and  other  muteins  isolated.  A  separate  band  for  [LsmaHe]?  (Figure  5.2B),  at  ~60  kDa, 
is  present  both  in  the  insoluble  and  soluble  fractions,  and  is  suggestive  of  a  different  stable 
oligomer,  corresponding  to  a  hexamer,  formed  during  the  denaturation  step  compared  to 
[H6Lsma]7.  This  was  attributed  to  the  placement  of  the  His6-tag  at  the  C-terminal.  A  higher 
yield  was  observed  for  [H6Lsma]7  recovered  in  the  soluble  cell  fraction,  compared  to 
[LsmaH6]7  as  observed  from  the  SDS-PAGE  analysis. 

5.3  Isolation  of  tagged  recombinant  products 

The  matrix  most  commonly  used  for  immobilised  metal  affinity  chromatography  (IMAC)  of 
His-tagged  protein  incorporates  nickel  (Ni2+)  8;  9.  However,  other  divalent  transition  metal 
ions  have  also  been  tested  successfully  for  IMAC  procedures,  taking  advantage  of  the  affinity 
of  metal  ions  to  the  aromatic  imidazole  ring  of  the  His  residues  10.  The  metals  most  widely 
used  are  cobalt  (Co2+),  zinc  (Zn2+),  and  copper  (Cu2+)  8;  n.  Following  the  application  of  a 
protein  mixture  to  the  adsorption  bed  at  pH  8.0,  high  concentrations  of  imidazole  facilitated 
the  desorption  of  a  His-rich  target  from  the  matrix,  after  washing  off  unbound  proteins. 
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Figure  5.31  Elution  profile  for  IMAC  purification  of  [ H^Lsmo.] 7.  (A)  [ Hffsma] 7  was 
successfully  purified  using  a  chelating  matrix  pre-charged  with  CUSO4.  Similar  results  were 
obtained  for  [LsmaHf,l7-  (B)  HiTrap  Chelating  (G  E  Healthcare )  columns  were  pre-charged 
with  different  metal  ions  (Section  5.3)  and  proteins  eluted  at  a  flow  rate  of  1  mL/min.  The 
trace  indicates  matrix  charged  with  Ni2+.  Traces  are  A280  (black)  and  conductance  (red). 
Binding,  washing  and  elution  steps  are  indicated  at  the  top  of  the  IMAC  trace.  The  SDS- 
PAGE  gel  shows  molecular  weight  standards  with  crude,  flowthrough  and  eluted  fractions. 
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Samples  of  [HgLsma]-?  and  [LsmaH^]-?  were  successfully  recovered  by  selectively  using  a  Cu- 
based  matrix.  Figure  5.3A,  shows  the  elution  profile  of  [H6Lsma]7  at  450  mM  of  imidazole, 
yielding  ~20  mg/mL  of  protein.  Similar  results  were  obtained  for  [LsmaH6]7.  Purification  of 
the  constructs  failed  when  metals  such  as  N1CI2,  ZnCl2  and  C0SO4  were  trialled  (Figure 
5.3B).  Once  purified,  the  samples  were  subjected  to  preparative  scale  SEC  in  order  to  assess 
the  species  distribution  in  the  samples. 

5.3.1  [H6Lsma]7  is  selective  for  Cu2+ 

To  understand  the  reason  for  selective  affinity  of  [HeLsma]7  and  [LsmaH(l]7  to  the  Cu2+- 
matrix,  it  is  imperative  to  know  the  chemistry  of  the  metal-ligand  binding  to  the  matrix  used 
and  its  interaction  with  the  accessible  His  residues  from  the  His6-tag.  Loading  different  metal 
ions  onto  a  resin,  results  in  variable  affinity  and  specificity  for  His6-tagged  proteins. 
Generally,  cobalt  exhibits  the  highest  binding  specificity  of  the  commonly  used  IMAC  metal 
ions.  Copper  lies  at  the  other  end  of  the  spectrum  and  has  a  high  affinity  leading  to  higher 
yields  but  nonspecific  binding  10.  Nickel  lies  in  the  middle  of  the  spectrum  and  provides 
specific  binding  and  reasonable  yields.  Although  IMAC  procedures  have  been  widely 
established,  self-assembling  proteins  can  pose  a  unique  challenge  due  to  the  number  of 
polyHis-tags  on  one  assembled  protein  structure  12  and  their  interaction  with  the  cation- 
charged  matrix. 

The  coordination  potential  of  a  metal  ion  is  largely  dependent  on  the  localisation  and 
topography  of  the  His  residues  exposed  on  the  protein  surface  8.  Moreover,  the  spatial 
distribution  of  the  His  residues  plays  a  critical  role  on  the  retention  capacity  of  a  metal-ion 
matrix  to  adsorb  the  protein  of  interest  13.  The  partial  burial  of  His  residues  due  to  the 
quaternary  structure  of  Lsma  and  possible  oligomerisation  in  the  matrix  itself,  may  limit  the 
strength  of  coordination.  Under  the  given  conditions  of  pH  and  salt,  the  His6-tagged  Lsma 
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Figure  5.41  Schematic  diagram  of  chromatographic  media  used  for  affinity  purification  of 
recombinant  proteins.  Adapted  from  Block  el  al 14 .  Cu2+-IDA  is  iminodiacetic  acid  forming  a 
complex  with  copper  ions. 

constructs  showed  optimum  binding  to  the  Cu2+-matrix.  Failure  of  binding  and  retention  in 
the  case  of  Ni2+,  Co2+,  and  Zn2+  can  be  attributed  to  the  formation  of  a  relatively  weak  ternary 
complex  formed  between  the  chelate,  metal  ion  and  the  protein.  Figure  5.4  shows  the 
complex  formed  by  the  His6-tagged  protein,  the  ligand  and  the  matrix. 

Such  behaviour  also  provides  insights  into  the  accessibility  of  the  His6-tags.  Failure  to 
capture  this  recombinant  protein  by  other  metal  ions  suggests  that  the  His6-tags  are  not 
entirely  exposed  to  the  surface  and  hence  not  accessible  for  binding  to  the  cation.  This 
observation  is  also  indicative  of  possible  metal-assisted  oligomerisation  of  the  quarternary 
structure  of  the  His6-tagged  Lsma  proteins  during  the  absorption  step  of  the  procedure 
(Figure  5.5).  The  differential  interactions  of  M2+-IDA  with  the  His6-tagged  Lsma  constructs 
in  the  IMAC  procedure  demonstrates  the  metal-binding  capacity  of  this  ring  scaffold  and 
provides  tantalizing  insights  for  further  work  in  the  field  of  metal-assisted  assembly  of  the 
Lsma  complex  and  other  protein  complexes  15, 16 . 
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Figure  5.51  Schematic  of  putative  mechanism  of  formation  of  metal-bound  [Hffsma]  7.  (A)  Possible  assembly  steps  and  interaction  with  copper 
ions  during  affinity  chromatography.  (B)  Intermolecular  metal-to-ligand  coordination:  metal  complexing  material  is  a  polymer  support 
derivatised  with  Cu2+ -iminodiacetate  (G  E  Healthcare ).  Given  the  multiplicity  of  the  His6-tags  oriented  towards  one  surface  of  the  recombinant 
Lsma,  a  microenvironment  was  created  at  the  attachment  point  of  the  cation  and  protein,  leading  to  the  formation  of  a  stronger  coordination 
bond  and  hence  a  strong  chelate  complex. 


152 


Chapter  five:  Biophysical  characterisation  of  His6-tagged  Lsma 


5.4  Solution  properties  of  His6-tagged  Lsma 

SEC  was  used  to  assess  the  distribution  of  oligomeric  species  for  the  His6-tagged  constructs 
using  methods  described  in  the  previous  chapter.  Lsma  prepared  as  a  GST-fusion  protein  was 
characterised  using  a  Superdex  75  matrix  which  was  consistent  with  heptameric  assembly  17 , 
in  agreement  with  SEC-SLS  results  in  Section  3.9.1.  A  similar  strategy  was  employed  to 
determine  the  octameric  organisation  of  Lsm3  18.  Therefore,  after  affinity  chromatography, 
both  [H6Lsma]7  and  [LsmaEbb  were  subjected  to  preparative  and  analytical  grade  SEC. 

Preparative  SEC  for  [H6Lsma]7  revealed  a  bimodal  distribution  of  species  (Figure  5.6A). 
Peak  1  eluted  at  the  void  volume  (V0),  indicating  a  species  of  >1300  kDa,  possibly  an 
aggregate  formed  during  the  affinity  purification.  Peak  2  eluted  within  the  sizing  range  of  the 
column  and  corresponded  to  a  size  of  400-500  kDa.  Fraction  II  was  collected  and  re¬ 
chromatographed  on  an  analytical  SEC  column.  [H6Lsma]7  eluted  as  a  single  symmetrical 
peak  spanning  Kav  0.2-0. 3  i.e.  corresponding  to  a  native  mass  of  -440  kDa.  The  estimated 
molecular  weight  was  consistent  with  a  species  containing  six  heptameric  rings  which  have  a 
predicted  mass  of  460  kDa.  The  [H^Lsma]?  sample  was  stable  for  several  weeks  stored  at  4 
°C  as  assessed  by  SEC.  By  SDS-PAGE,  a  pure  preparation  of  [H6Lsma]7  ran  as  three  bands 
at  10,  50,  80  kDa  (Figure  5.6B).  As  seen  from  the  gel,  the  relatively  minor  species  ran  at  an 
expected  size  of  1 1  kDa.  The  other  two  pronounced  bands  running  at  -50  kDa  and  -80  kDa, 
can  be  attributed  to  formation  of  stable  oligomers  during  the  denaturation  step.  This  triplet  of 
bands  is  diagnostic  for  His6-tagged  Lsma  assembly  and  was  also  noted  above  during 
purification  (Section  5.3).  Thus,  [H6Lsma]7  exhibited  similar  characteristics  consistent  with 
the  robust  and  stable  properties  of  the  Lsma  heptameric  ring  scaffold  formed,  at  the  same 
time,  exhibiting  higher  order  species  formation,  not  previously  observed  for  Lsma. 
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Figure  5.6\  SEC  chromatograph  of  [H^Lsma]  7.  (A)  Preparative  grade  SEC  (Superdex  200 
16/600 )  column  pre-equilibrated  with  20  mM  Tris  pH  8.0,  150  mM  NaCl,  2%  glycerol  with 
a  flow  rate  of  1  mL/min.  IMAC  elution  profile  provided  in  inset.  (B)  Analytical  SEC 
( Superdex  200  10/300)  column,  in  the  same  buffer  as  above  with  a  flow  rate  of  0.4  mL/min. 
(C)  SDS-PAGE  of  purified  [ H6Lsma]7 . 
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Figure  5.7 1  SEC  chromatograph  of  [ LsmaH 6] 7.  (A)  Preparative  grade  SEC  (Superdex  200 
16/600)  column  pre- equilibrated  with  20  mM  Tris  pH  8.0,  150  mMNaCl,  2%  glycerol  with  a 
flow  rate  of  1  mUmin.  1MAC  elution  profile  provided  in  inset.  (B)  Analytical  SEC  (Superdex 
200  10/300),  in  the  same  buffer  as  above  with  a  flow  rate  of  0.4  mL/min.  (C)  SDS-PAGE  of 
[LsmaH 7. 
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[LsmaH6]7  exhibited  a  different  SEC  profile  compared  to  [H6Lsma]7.  On  the  preparative 
SEC,  it  eluted  as  a  single  species  with  negligible  material  eluting  in  the  void  volume  (Figure 
5.7A),  in  contrast  to  the  bimodal  distribution  obtained  for  [H6Lsma]7.  The  calculated  Kav  of 
0.43  corresponds  to  ~  50  kDa.  This  fraction  separated  as  two  species  when  injected  onto  an 
analytical  sizing  column  in  Tris/NaCl  pH  8.0  buffer  (Figure  5.7B).  The  majority  formed 
aggregates  eluting  at  the  void  volume,  whereas  the  second  fraction  had  a  Kav  of  0.45 
corresponding  to  ~66  kDa,  indicative  of  a  single  heptameric  ring.  Characterisation  of  this 
fraction  proved  to  be  difficult  as  it  precipitated  within  a  few  hours  at  4  °C.  Therefore,  due  to 
the  unstable  behaviour  of  [LsmaH6]  compared  to  [H6Lsma]7,  the  latter  was  chosen  for  further 
study. 

From  the  solution  studies  of  the  two  His6-tagged  constructs,  the  position  of  the  tag  plays  a 
critical  role  in  the  multimerisation  of  the  Lsma  rings  to  higher  order  species.  For  [H6Lsma]7 , 
the  His6-moiety  is  predicted  to  be  oriented  towards  one  face  of  the  ring  (Section  5.1), 
whereas,  for  [LsmaH6]7,  since  the  tags  are  directed  at  the  equatorial  plane  of  the  ring,  the 
orientation  of  the  His  residues  may  not  be  conducive  for  interaction  with  metal  ions.  It  is 
difficult  to  predict  whether  [LsmaH(l]7  assemblies  are  metal  directed  due  to  the  unstable 
nature  of  the  sample  and  lack  of  subsequent  characterisation. 

Six  to  seven  [H6Lsma]7  rings  form  a  discrete  assembled  structure.  This  species  is  in  contrast 
to  the  single-ring  species  obtained  by  engineering  Lsma  as  a  GST-fusion.  Thus,  the  inclusion 
of  a  His6-tag  in  the  native  sequence  of  Lsma  provides  a  new  level  of  control  for  the 
hierarchical  organisation  of  its  quaternary  structure. 
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5.5  Visualisation  of  [H6Lsma]7  and  [LsmaH6]7  by  TEM 

[H6Lsma]7  oligomerised  into  a  defined  cluster  of  individual  rings  as  ascertained  by  negative 
stain  transmission  electron  microscopy.  The  cluster  had  a  diameter  of  about  ~9  nm  (Figure 
5.8).  In  most  of  the  particles,  single  rings  with  pores  were  visible.  It  is  evident  from  the 
micrographs,  that  individual  [H(,Lsma]7  rings  are  arranged  in  a  cage-like  architecture.  Such 
an  arrangement  is  suggestive  of  a  hollow  cavity,  formed  by  an  angular  arrangement  of  the 
rings.  These  dimensions  are  much  larger  than  the  single  His6-Lsma  rings  (6.5  nm  wide  with 
central  pore  of  1.5  nm)  as  well  for  other  Lsm  proteins  such  as  Lsm3  octamer  18  (7.5  nm  wide 
and  2  nm  pore).  Therefore,  the  insertion  of  the  His6-tag  at  the  N-terminal  of  the  protein 
sequence  triggered  the  association  of  Lsma  rings  into  a  higher  ordered  organisation  of  a 
defined  shape  and  dimension. 

[LsmaH6]7  on  the  other  hand,  did  not  exhibit  any  definite  assemblies  as  observed  by  TEM 
(Figure  5.9).  Although,  single  rings  were  visible,  the  overall  sample  was  heterogenous  and 
nature  of  the  sample  indicated  aggregate  formation.  Due  to  lack  of  any  discrete  and  uniform 
organisation  of  [LsmaH6]7  rings,  this  construct  was  not  characterised  further. 

Single  particle  averaging  was  attempted  on  [H6Lsma]7  using  the  EMAN  suite.  Due  to  the 
heterogeneous  nature  of  the  sample  and  high  contrast  due  to  negative  staining,  the  obtained 
averaged  image  was  not  useful. 
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Diameter  [H  Lsma]7  (nm) 


Figure  5.51  Negative  stain  TEM  of  [H^Lsma] y.  (A)  Micrographs  showing  well-distributed 
distinct  clusters  of  [Hffsma] 7  (0.05  mg/mL)  prepared  in  Tris/NaCl  buffer.  (B)  Size 
distribution  histogram  of  [H^Lsma]  7.  (C)  Magnified  particles  from  the  original  micrographs. 
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Figure  5.9\  TEM  of  [LsmoiH6]7.  Single  rings  are  visible  but  the  sample  is  heterogeneous  in 
nature. 


Thus,  TEM  and  analytical  SEC  analysis  revealed  that  the  [H6Lsma]7  complex  obtained  is 
comprised  of  about  six  individual  rings  associated  through  contacts  mediated  by  the  His 
residues.  Although  His-tags  do  not  cause  structural  changes  in  most  proteins  19,  there  have 
been  examples  where  inclusion  of  a  His-tag  in  the  native  sequence  has  caused  changes  in 
disulfide  bonding  patterns  20  and  conformational  changes  at  a  binding  site  21 .  Metal-assisted 
oligomerisation  of  His-tagged  proteins  has  been  seen  in  Pbx  protein  fused  with  a  Hisio-tag 
but  the  exact  nature  of  the  oligomer  is  not  known  15.  Only  recently  there  have  been  reports  of 
rationally  engineered  single  His  and  Cys  residues  in  a  protein  sequence  to  artificially  induce 
metal-mediated  assembly  of  proteins.  New  crystal  forms  were  produced  by  engineering  metal 
contact  points  between  protein  interfaces  for  T4  lyzozyme  16.  This  approach  is  also  being 
used  to  develop  novel  protein  nanostructures  and  assemblies.22 
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5.6  Polydispersity  of  [H6Lsma]7 

Although  [H6Lsma]7  eluted  as  a  single  species  on  the  analytical  SEC,  the  failure  to  produce 
an  averaged  structure  of  the  TEM  images  suggested  that  the  structures  formed  were 
polydisperse.  In  order  to  assess  the  polydispersity  of  the  complex,  SEC  was  coupled  with 
static  light  scattering  (SEC-SLS).  This  technique  provides  a  direct  measure  of  the  molecular 
weight  from  the  change  in  the  light  scattering  based  on  the  Debye-Zimm  equation  23  (Section 
2.6.5).  [H6Lsma]7  samples  eluted  from  the  SEC  column,  were  directly  passed  through  a  right 
angle  light  scattering  detector  to  ascertain  the  molecular  weight  of  each  point.  This  measure 
of  molecular  weight  was  not  constant  over  the  peak  as  it  trails  down  over  the  elution  profile 
(Figure  5.10).  The  molecular  weight  range  measured  at  each  point  is  400  to  500  kDa, 
indicating  that  [H6Lsma]7  consists  of  five  to  seven  rings  forming  a  complex.  Attempts  to 
generate  a  more  monodisperse  preparation  of  [H6Lsma]7  were  trialled  by  employing  different 
gel  filtration  matrices  (Superose  12  and  Sephacryl  400)  to  separate  the  five-membered 
complex  from  the  six-  and  seven-membered  complexes,  but  efficient  separation  could  not  be 
achieved.  Since  Lsma  is  thermostable,  it  was  hypothesised  that  heating  it  would  encourage 
the  formation  of  a  stable  homogenous  product  by  rearrangement  of  the  complex,  but  again 
such  a  strategy  did  not  succeed  in  generating  a  more  homogenous  state.  The  purified 
[H6Lsma]7  sample  was  heated  upto  60  °C,  70  °C  and  80  °C,  and  analysed  by  analytical  SEC. 
The  results  were  similar  to  that  obtained  for  the  non-heated  sample,  hence  failed  to  resolve 
the  [H6Lsma]7  mixture  (data  not  shown). 
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Retention  time  (min) 

Figure  5.10\  Static  light  scattering  showing  the  molecular  weight  distribution  of  [H^Lsma] 7. 
The  run  was  operated  in  20  mM  Tris,  pH  8.0,  150  mM  NaCl,  2%  glycerol  at  a  flow  rate  of  0.4 
mL/min. 

5.7  Factors  affecting  the  assembly  of  [H6Lsma]7 

In  order  to  probe  the  factors  that  control  assembly  of  [H6Lsma]7  rings,  several  sets  of  solution 
conditions  were  tested.  Previous  studies  on  Lsm  proteins  have  indicated  that  the  ionic 
strength  of  the  buffer  influences  the  oligomerisation  state  of  Lsm[4+1]  polyprotein  24 .  pH 
plays  a  role  in  stacking  events  of  Lsm3,  with  low  pH  encouraging  formation  of  higher 
molecular  weight  species  and  assembly  of  multiple  octameric  rings  25 .  As  noted  in  Section 
1.9,  head-to-tail  stacking  events  have  been  observed  in  the  case  of  untagged  Lsma  (PDB  ID 
ILOJ)  thought  to  be  facilitated  by  the  electrostatic  interactions  of  the  loop4  and  the  helical 
face.  Therefore,  in  order  to  understand  the  nature  of  [H6Lsma]7  assembly,  solution 
characterisation  with  varying  pH  and  salt  concentrations  was  carried  out. 
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The  oligomeric  assembly  of  [H6Lsma]7  was  found  to  be  influenced  by  pH.  This  behaviour 
was  seen  by  aliquoting  the  affinity-purified  samples  into  phosphate  buffer  at  pH  7.0  and  Tris 
buffer  at  pH  8.0  and  pH  9.0.  The  salt  concentration  was  kept  constant  at  150  mM  NaCl.  At 
pH  8.0,  [HfiLsma]?  was  observed  to  elute  solely  as  a  single  species  (~  440  kDa).  However, 
when  the  same  affinity-purified  protein  was  applied  to  the  sizing  column  at  pH  7.0,  the 
majority  eluted  near  the  void  volume,  indicating  aggregation.  This  could  be  attributed  to  the 
pi  of  His6-Lsma  being  6.5  and  nearing  this  value  could  trigger  aggregation.  At  pH  9.0,  the 
assembly  seems  to  be  reorganised  into  three  species  with  the  dominant  fraction  eluting  at  Kav 
0.36  corresponding  to  -300  kDa.  At  pH  9.0  and  above,  the  imidazole  ring  is  deprotonated 
which,  as  shown  later  in  the  chapter,  is  predicted  to  weaken  the  interactions  between  the 
metal  ion  and  the  His  residues.  The  behaviour  of  [H6Lsma]7  at  high  pH  could  also  be 
explained  by  ionisation  of  the  pyrrole  hydrogen  of  the  His  side-chain  which  may  cause 
perturbation  in  the  already  formed  assembly  and  cause  it  to  rearrange  to  a  more  stable 
complex  at  high  pH  28 . 

Affinity-purified  samples  of  [H(,Lsma]7  were  also  dialysed  into  buffers  with  different  ionic 
strength  (100-800  mM  NaCl).  These  conditions  did  not  alter  the  [H6Lsma]7  organisation, 
thus,  ruling  out  electrostatically-mediated  interactions  between  the  rings. 
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B 


pH  7.0 


pH  8.0 


pH  9.0 


Figure  5.771  SEC  traces  of  [H^Lsma]  7  at  different  pH  carried  out  using  a  Superdex  200 
10/300  column  at  a  flow  rate  of  0.4  mL/min. 
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5.8  [H6Lsma]7  is  Cu2+  bound 

As  described  in  Section  5.3,  [H6Lsma]7  was  prepared  using  a  Cu2+-matrix.  During  the 
purification  procedure,  the  soluble  lysate  was  exposed  to  this  matrix  for  1  h,  allowing  for 
maximum  product  adsorption.  This  may  have  resulted  in  metal  leaching  from  the  IMAC 
adsorbant,  forming  a  chelation  product  with  the  recombinant  protein.  To  test  this  hypothesis, 
[H6Lsma]7  samples  were  washed  with  competing  chelating  agent,  EDTA,  to  ensure  an  apo 
(metal-free)  form  of  the  complex.  Figure  5.12,  shows  a  time  course  of  SEC  chromatograms 
of  [H6Lsma]7  material  treated  over  48  hours.  This  experiment  opens  up  the  possibility  of 
switchable  conversion  from  a  large  complex  to  a  smaller  species  by  removal  of  bound  metal. 

(i)  Size-distribution  at  time  4  h 

[H6Lsma]7  was  treated  with  a  molar  excess  of  EDTA  (10  mM).  Figure  5.12A  shows  the 
elution  profile  of  this  sample  taken  after  4  h  of  exposure  to  the  chelating  agent.  It  showed  a 
bimodal  distribution  with  fraction  I  at  Kav  0.2  (~  400  kDa)  and  fraction  II  at  0.4  (~  70  kDa). 
These  two  fractions  corresponded  to  the  six-ring  membered  [H6Lsma]7  assembly  and  single 
ring  form,  respectively.  The  isolated  sample  from  fraction  II,  when  re-chromatographed  in 
the  sizing  column,  did  not  rearrange  to  form  other  oligomeric  forms,  consistent  with  the 
assembly  process  being  metal  dependent.  In  order  to  drive  the  equilibrium  towards  smaller 
species,  a  more  extended  treatment  of  EDTA  was  employed. 
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Figure  5.12\  SEC  profiles  of  [H^Lsma] y  in  presence  ofEDTA  carried  out  in  Superdex  200 
16/600  with  a  flow  rate  of  1  mL/min.  (A)  10  mM  EDTA,  4  h.  (B)  15  mM  EDT A,  24  h.  (C)  15 
mM  EDTA,  48  h.  Dashed  lines  indicate  position  of  the  [H^Lsma]  7  complex  and  single  ring. 
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(ii)  Size-distribution  at  time  24  h 

The  percentage  of  metal-free  single  ring  form  of  [H6Lsma]7  increased  with  an  increase  in 
concentration  of  EDTA  and  extending  the  contact  time  to  24  h.  From  Figure  5.12B,  fraction  I 
shifted  its  elution  volume  to  Kav  0.23,  indicating  further  dissociation  into  smaller  species 
comprising  of  roughly  four  rings.  There  was  also  formation  of  aggregates  as  seen  in  the  void 
volume. 

(iii)  Size-distribution  at  time  48  h 

The  SEC  analysis  revealed  a  highly  mixed  population  after  48  h  of  incubation  with  EDTA 
(Figure  5.12C).  There  was  a  marked  increase  in  the  formation  of  the  single-ring  apo  (metal- 
free)  form  of  [H6Lsma]7.  Due  to  the  long  duration  of  treatment  there  was  more  aggregate 
formation  as  seen  from  the  fraction  eluting  at  the  void  volume. 

The  next  step  was  to  explore  the  conditions  which  provide  a  more  efficient  switch  to 
transition  between  the  six-ring  membered  metal-bound  assembly  and  the  single-ring  metal- 
free  form. 

(iv)  Complete  dissociation  of  [H^LsmaJy  metal-bound  assembly 

As  seen  from  the  pH  studies  carried  out  on  [H6Lsma]7  (Section  5.7),  the  assembly 
dissociated  into  smaller  species  at  pH  9  and  above.  Therefore,  this  condition  was  applied  to 
the  affinity-purified  [H6Lsma]7  in  the  presence  of  20  mM  EDTA  for  26  h  at  4  °C.  A  complete 
dissociation  of  the  cage-like  assembly  took  place,  as  seen  from  the  SEC  trace  (Figure 
5.13A).  This  fraction  was  analysed  by  TEM,  which  also  showed  the  presence  of  single  rings 
(Figure  5.13B).  Thus,  this  condition  is  a  suitable  means  to  switch  from  a  Cu2+-bound 
oligomeric  assembly  to  a  metal-free  single-ring  form. 
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Table  5.11  Table  showing  percentage  fraction  of  the  species  formed  during  EDTA  treatment 
of  [H6Lsma]7. 

Condition3  Percentage  fraction11 


Single  ring  Intermediate  species  Metal-bound  [H6Lsma]7 


20  mM  EDTA,  26  h,  pH  9.0 

100 

- 

- 

15  mM  EDTA,  48  h,  pH  8.0 

54 

46 

- 

15  mM  EDTA,  24  h,  pH  8.0 

42 

58 

- 

10  mM  EDTA,  4  h,  pH  8.0 

28 

- 

72 

a  20  mM  Tris  pH  8.0,  150  mM  NaCl 
b  Area  under  curve  calculated  in  OriginPro  software  suite. 


(v)  Formation  of  metal-bound  [H6Lsma]7 

In  metal  affinity  chromatography,  the  metal  ion  is  not  directly  attached  to  the  polymer 
support  but  via  a  chelating  group,  which  in  turn  is  covalently  attached  to  the  polymer  (Figure 
5.5B).  Thus,  the  protein  engineered  with  metal-coordinating  groups  (affinity-groups)  adsorb 
onto  the  functionalised  polymer  for  affinity  separation.  The  molecular  mechanism  of  protein 
adsorption  to  the  derivatised  functional  polymer  surface  involves  protein-to-metal 
interactions.  Since  the  binding  energy  of  metal-to-ligand  is  larger  than  that  of  protein-to- 
metal  binding  29 ’ 30: 31 ,  the  IMAC  matrix  is  a  robust  and  efficient  system  for  recombinant 
protein  isolation.  A  special  case  arises  for  variants  with  multiple  surface  metal-coordinating 
groups  which  have  larger  binding  affinity.  This  causes  the  protein  to  simultaneously  interact 
with  multiple  metal  ions  on  the  derivatised  surface.  The  binding  selectivity  in  affinity 
chromatography  methodology  is  largely  influenced  by  the  multiplicity  and  microenvironment 
created  by  the  metal-binding  sites  (His  residues  and  deprotonated  amines)  exposed  on  the 
protein  surface  32;  33 . 


167 


Chapter  five:  Biophysical  characterisation  of  His6-tagged  Lsma 


In  [H6Lsma]7,  seven  affinity-segments  are  oriented  towards  one  side  of  the  ring  scaffold,  with 
six  His  side-chains  on  each  segment,  serving  as  potential  sites  to  interact  with  metal  ions. 
Strong  binding  of  [H6Lsma]7  is  achieved  by  multi-point  attachment  of  these  residues.  This 
complex  interaction  leads  to  the  formation  of  a  very  strong  protein-Cu2+-chelate  complex, 
greater  than  the  Cu2+-IDA  interaction.  Therefore,  it  is  plausible  that  [H6Lsma]7  is  formed  in 
the  microenvironment  created  by  the  ligand-metal-protein  complex  and  the  assembly  thus 
formed  leaches  Cu2+  from  the  matrix.  Subsequently,  with  reasonable  exposure  time  of 
adsorption  to  the  Cu2+-matrix,  [H6Lsma]7  eluted  out  as  a  metal-bound  form  in  high  ionic 
strength  buffer.  Taking  into  account  the  three-dimensional  scaffolding  by  the  folded  protein, 
the  rings  conform  to  a  closed-structure  yielding  to  [HeLsma]?  rings  surrounding  a  cluster  of 
Cu2+  ions. 


Figure  5.13\  Single-ring  form  of  / Hffsma] 7  shown  by  (A)  Analytical  SEC;  the  dashed  line 
indicates  the  elution  volume  of  a  single  ring  state  and  (B)  TEM  of  [H^Lsma]  7  as  single  rings 
post-EDTA  treatment. 
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5.9  Metal  conjugation  of  [H6Lsma]7 

Selective  metals  were  chosen  depending  on  their  affinity  to  His  residues  and  incubated  with 
[H6Lsma]7.  Analytical  SEC  was  used  to  analyse  any  change  in  oligomeric  state  of  the 
complex  formed.  As  discussed  in  the  previous  Section  5.3,  transition  metals  such  as  Ni2+, 
Zn2+,  and  Co2+  have  high  specific  affinity  to  His  groups  exposed  on  the  protein  surface.. 
Addition  of  a  molar  excess  of  nickel  ions  to  the  [H(lLsma]7  sample  shifted  the  elution  peak 
towards  a  higher-order  molecular  weight  species  (Figure  5.14B).  This  fraction  when  isolated 
and  treated  with  a  competing  chelating  agent,  shifts  back  to  a  -400  kDa  form  (Figure  5.14C). 
This  indicates  a  reversible  metal-binding  capacity  of  affinity-purified  [H6Lsma]7  complex. 
Thus,  there  is  formation  of  a  bi-metallic  [H6Lsma]7  complex,  that  binds  to  Cu2+  ions  from  the 
affinity-chromatography  and  Ni2+  ions  when  added  externally.  The  most  likely  explanation  of 
this  occurrence  is  the  high  specific  affinity  of  Ni2+  to  the  imidazole  groups  exposed  on  the 
surface  of  the  protein,  which  leads  to  multimerisation  of  the  rings  onto  the  affinity-purified 
[H6Lsma]7.  Similar  experiments  carried  out  with  Zn2+  and  Co2+  ions,  did  not  cause  any 
change  in  multimerisation  and  higher-order  oligomerisation  of  [H6Lsma]7. 

Metal  ions  were  added  to  un-tagged  Lsma  to  assess  its  metal  tolerance.  Equimolar  ratio  of 
metal  to  protein  invariably  resulted  in  white  precipitate  indicating  that  the  His6-tag  moiety  is 
required  for  metal  binding  and  increases  the  robustness  of  the  Lsma  complex  formed. 
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B 


[H(,Lsma]7 


[H6Lsma]7  +  Ni 


2+ 


[H6Lsma]7  +  Ni2++  EDTA 


Figure  5.741  SEC  traces  showing  solution  behaviour  of  [H^Lsma]  7  on  addition  of  Ni2+ (0.2 
mM).  The  dotted  lines  represent  elution  position  of  [H^Lsma] 7  complex  and  single-ring  form. 
The  running  buffer  was  20  mM  Tris,  pH  8.0,  150  mM  NaCl,  2%  glycerol.  Sizing  column  used 
was  a  Superdex  200  10/300  column  run  at  a  flow  rate  of  0.4  mL/min. 
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5.10  Small-angle  X-ray  scattering  (SAXS) 

In  previous  solution  state  characterisation  of  Lsm  proteins  at  the  Macquarie  Protein  Structural 
Group,  none  of  the  prepared  Lsm  proteins  have  shown  discrete  higher  order  assemblies. 
Although  stacking  has  been  inferred  by  SEC  and  seen  in  crystal  packing,  both  axial  and 
staggered,  in  different  cases  of  Lsm  proteins,  including  Lsma,  clusters  have  not  been 
previously  observed.  Therefore,  SAXS  was  used  to  pursue  structural  characterisation  of  the 
[PLLsma]?  construct. 

Samples  of  [PLLsma]?,  were  analysed  using  the  SAXS  beamline  at  the  Australian 
Synchrotron  34 .  The  SAXS  beamline  has  a  protein  specific  arrangement  with  an  in-line  SEC 
system  to  fractionate  protein  preparations  into  their  individual  components.  A  Superdex  200 
5/150  GL  gel  filtration  column  was  placed  in-line  with  the  SAXS  beamline.  This  arrangement 
ensured  monodispersity  and  buffer  matching  which  are  crucial  for  obtaining  quality  SAXS 
data  for  proteins  35 .  Data  were  continuously  collected  as  the  protein  eluted  from  the  column. 
Scattering  patterns  for  [H6Lsma]7  were  radially  averaged  to  derive  one-dimensional 
scattering  curves  and  scattering  contributions  of  the  buffer  were  subtracted  from  the 
presented  curves.  This  data  reduction  was  carried  out  using  ScatterBrain  software  developed 
at  the  Australian  Synchrotron  34. 

To  assess  data  quality  and  provide  a  more  precise  definition  of  molecular  dimensions, 
Guinier  plots  were  calculated.  The  fact  that  the  plots  were  linear  and  the  scattering  data  did 
not  show  any  upward  curvature  at  low  angles,  allows  us  to  eliminate  the  possibility  of 
aggregate  formation  (Figure  5.15B).  The  Guinier  plots  provide  data  quality  assurance  for 

o/: 

further  processing  and  shape  determination.  The  slope  of  Guinier  plots  yield  radius  of 
gyration,  Rg  values  in  reciprocal  space.  For  [FLLsma]?  the  RgKC  was  calculated  as  5.1+0.05 
nm. 
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A  B 


Figure  5.15 1  SAXS  data  recorded  at  SAXSAVAXS  beamline  at  Australian  Synchrotron.  The  following  parameters  were  calculated  for  [H^LsmaJy 
loaded  onto  the  SEC  column  at  2  mg/mL  (A)  Scattering  curve.  (B)  Guinier  approximation.  (C)  Distance  distribution  function.  (D)  Kratky 
representation  of  scattering  data. 
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Fourier  transformation  of  collected  data  produced  smooth  bell-shaped  electron-pair 
distribution  function  (P(rj)  for  this  complex  (Figure  5.15C).  The  real  space  radius  of 
gyration  ( Rg  real)  was  5.16+0.1  nm  and  the  maximum  particle  dimension,  Dmax  was  16.3  nm 
calculated  from  Autoporod  37 .  The  real  space  radius  of  gyration  is  in  good  agreement  with 
reciprocal  space  data  derived  from  the  Guinier  fit.  In  order  to  determine  that  [H6Lsma]7  exists 
in  a  folded  globular  state,  a  Kratky  plot  was  derived  (Figure  5.15D).  The  sample  yielded  a 
sharp  clear  peak  at  low  values  of  q ,  with  a  plateau  at  higher  values.  This  trend  is  indicative  of 
a  folded  globular  protein. 

SAXS  data  can  be  used  to  derive  three-dimensional  molecular  envelopes  38.  This  provides  a 
detailed  picture  regarding  the  structure  of  the  protein  being  studied.  Ab  initio  reconstruction 
of  [H6Lsma]7  using  GASBOR  35  yielded  an  obtuse  ellipsoid  geometry.  To  validate  the 
solution  structure  of  [H(,Lsma]7  obtained,  the  coordinate  file,  PDB  1181,  was  used  to  calculate 
a  theoretical  X-ray  scattering  curve.  The  resulting  curve  was  compared  to  the  experimental 
scattering  curve.  Both  visual  inspection  and  the  Chi2  value  of  5.1  suggested  that  it  is  a  very 
poor  fit  (Figure  5.16)  confirming  that  the  [H(,Lsma]7  complex  exists  as  a  higher-order 
organisation  of  individual  Lsma  rings. 


Figure  5.16\  Overlay  of  experimental  scattering  by  [H^smajy  (blue)  and  theoretical 
scattering  of  Lsma  (PDB  ID  1181)  (red)  using  CRYSOL  39 . 
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In  order  to  check  whether  the  oligomerisation  of  [H6Lsma]7  was  concentration  dependent, 
different  sections  of  the  eluted  peak  as  the  sample  eluted  from  the  SEC  column  were 
analysed.  The  frames  were  chosen  and  analysed  separately  for  the  fractions.  Table  5.2  lists 
the  parameters  collected  for  different  fractions  of  [H6Lsma]7.  The  molecular  weight  was 
calculated  from  the  Porod  volume  using  Autoporod  37.  The  data  suggest  that  the 
oligomerisation  of  [H6Lsma]7  is  similar  at  different  sections  of  the  eluted  peak,  thus 
indicating  concentration  independent  oligomerisation.  Figure  5.17  shows  the  overlaid 
scattering  curves  of  the  frames  collected. 

[H6Lsma]7  has  a  molecular  weight  distribution  of  400-500  kDa  as  determined  by  SEC-SLS 
(Section  5.6).  This  indicates  a  modular  combination  of  five  to  seven  Lsma  rings.  Pymol  was 
used  to  generate  combinations  of  rings  using  symmetry  functions  and  PDB  files  were 
generated.  Theoretical  curves  from  these  PDB  files  were  then  compared  to  the  experimental 
scattering  curve  of  [H6Lsma]7  to  assess  the  arrangement  of  rings  in  the  complex  (Figure 
5.18). 


Figure  5.17 1  Overlaid  non-scaled  scattering  curves  of  [H^smajy  calculated  from  different 
regions  of  the  eluted  proteins  at  different  concentrations. 
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Table  5.21  Geometric  parameters  collected  for  different  fractions  of  [HeLsma^ . 

Number 

of 

frames 

Dmax  (nm) 

Rg rec  (nm) a 

Rg real  (nm) b 

Volume  a 
(nm3) 

Mwa  (+0. 1 
kDa) 

Number 
of  rings 

7 

16.1 

5.1+0.05 

5. 1+0.1 

920 

567 

7.3 

10 

16.3 

5.1+0.05 

5. 1+0.1 

908 

556 

7.2 

10 

16.0 

5.1+0.05 

5. 1+0.1 

900 

554 

7.2 

3  Calculated  from  Autoporod  32 . 
b  Calculated  from  GNOM  33. 

RSKC,  reciprocal  space  radius  of  gyration  derived  from  Guinier  approximation;  /?greal,  real  space  radius 
of  gyration  from  distance  distribution  function;  Dmax,  maximum  particle  diameter. 

All  data  were  obtained  in  PBS,  pH  8.0. 


Visual  inspection  confirms  poor  agreement  (Figure  5.19B(ii))  although  it  is  better  than  the 
comparison  between  PDB  1181  and  the  experimental  data.  It  can  therefore,  be  concluded  that 
[H6Lsma]7  does  not  contain  a  stacked  arrangement  but  maybe  an  angular  formation  of  rings. 
This  is  also  indicative  of  a  hollow  cavity  re-enforcing  the  notion  that  this  complex  has  a  cage¬ 
like  architecture  as  seen  by  TEM. 


5-mer  6-mer  7-mer 

Figure  5.18 1  Different  combinations  of  rings  obtained  from  the  crystal  lattice  of  Lsma  (PDB 
ID  1181 )  used  to  fit  into  ab  initio  model  of  [ Hffsma]  7.  Structures  were  generated  in  Pymol 40 . 
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(A(ii))  Overlay  of  the  ab  initio  model  obtained  (pink)  and  the 


experimental  data  (blue).  (B(i))  Superimposition  of  six  rings  on  the  GASBOR  model  of  [H^LsmaJ  7  using  SUP  COMB  41 .  ( B(ii ))  Fits  of  different 


combination  of  rings  on  the  scattering  curve  of  the  ab  initio  model. 
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5.11  Summary 

The  manipulation  of  the  quaternary  structure  of  His6-tagged  Lsma  into  higher-order 
hierarchical  organisations  was  achieved  by  placement  of  His6-segments  at  one  end  of  the 
sequence  of  the  self-assembling  system.  The  solution  behaviour  of  Lsma  is  influenced  by  the 
position  of  the  tag  at  either  the  N-  or  C-terminus  of  the  protein  sequence,  with  the  N-terminal 
[H6Lsma]7  complex  exhibiting  a  more  robust  and  stable  solution  behaviour  and  assembly  into 
a  discrete  higher  order  organisation. 

The  [FLLsmajv  assembly  was  shown  to  be  Cu2+-bound.  Such  behaviour  has  been  observed  in 
the  case  of  other  proteins  isolated  by  affinity  chromatography  but  [H(lLsma]7,  being  ring- 
shaped,  organises  into  regular  cage-like  structures.  Such  a  desirable  architecture  for 
generation  of  biological  nanostructures  is  often  lacking  in  globular  proteins.  The  [HflLsina]7 
metal-bound  assembly  is  consistent  with: 

i.  Preferential  multi-point  interactions  with  high  binding  strength  to  copper  ions  via 
coordination  bonds  between  the  His  moiety  . 

ii.  Selective  binding  of  the  metal  ion  to  the  exposed  and  available  deprotonated  amines 
on  the  protein  surface  9 . 

Both  mechanisms  provide  an  explanation  for  the  formation  of  a  thermodynamically  stable 
metal-bound  [H6Lsma]7  assembly  resulting  in  a  tightly  bound  closed-architecture.  Such  an 
organisation  has  not  been  observed  for  other  Lsm  protein  systems.  The  types  of  interactions 
present  in  other  Lsm  proteins  are: 

i.  Face-to-face  stacking  present  in  the  case  of  Lsm3  of  two  octameric  rings  in  the  crystal 
structure  mediated  via  the  N-termini,  involving  salt  bridge  formation  and  His 
interactions  18. 
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ii.  Head-to-tail  stacking  observed  in  Lsma  by  electrostatic  contributions  of  acidic 
residues  present  on  the  loop  face  and  exposed  basic  and  polar  residues  on  the  a-helix 
face  42  43 ;  44 . 

In  contrast,  the  interactions  involved  in  the  [H6Lsma]7  complex  are  mainly  due  to  metal- 
coordination  bonds  and  have  been  shown  not  to  be  electrostatic  in  nature.  Although,  pH  plays 
a  role  in  the  stabilisation  of  the  complex,  conditions  were  optimised  to  switch  it  to  a  single¬ 
ring  form.  With  the  tag  on  the  surface  on  the  ring  scaffold,  it  was  possible  to  obtain  ordered 
multimers  of  the  modified  ring  through  metals.  The  tags  form  reversible  yet  stable  metal- 
chelate  complexes  and  confer  upon  the  Lsma  ring  scaffold  the  ability  to  form  supramolecular 
architectures. 
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6  Chapter  six 

Discussion  and  suggestions  for  future  work 

6.1  Introduction 

Self-assembling  proteins  have  been  recognised  in  the  literature  as  being  among  a  number  of 
biomolecules  and  biomolecular  assemblies  that  have  the  potential  to  be  used  as  nanomaterials 
1;  2;  3.  The  aims  of  this  project  were  to  explore  the  utility  of  the  self-assembling  Lsma  system 
and  to  design  and  characterise  the  various  nanostructures  that  could  be  adopted  by  the  ring- 
shaped  protein.  A  number  of  other  protein  systems  have  been  identified  and  explored  for 
incorporation  into  nanodevices  4’  5  by  modifying  the  protein  interface  to  generate  suitable 
structures.  This  approach  provides  an  opportunity  to  design  the  proteins  to  assemble  in  a 
precise  manner  in  three  dimensional  space  ’  .  In  order  to  exploit  a  bottom-up  approach  for 
the  fabrication  of  molecular  biomaterials,  it  has  become  important  to  better  understand  and 
control  the  process  of  self-assembly  of  protein  units  into  their  supramolecular  organisations. 
Therefore,  the  study  of  more  protein  tectons  with  suitable  properties  is  warranted. 

Molecular  building  blocks  should  possess  properties  which  make  them  suitable  for 
downstream  applications  and  provide  better  control  over  their  assembly.  First  and  foremost, 
the  system  should  exhibit  an  inherent  ability  to  self-assemble  into  clearly  organised  forms 
and  combinations.  This  feature  provides  a  basis  from  which  to  develop  general  principles  for 
assembly  of  higher  order  structures  from  simple  protein  components.  A  naturally  self¬ 
assembling  system  provides  opportunities  to  engineer  scaleable  structures  from  the  protein 
componentary. 
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Secondly,  the  tectons  should  possess  adequate  functionality  that  can  be  explored  for  potential 
applications.  The  chemistry  of  protein  modules  allows  metals  to  be  readily  introduced  or 
deposited  on  the  surface  or  cavities  of  proteins  depending  on  the  purpose.  Such  methods  of 
using  proteins  as  metal-depositing  templates  have  been  explored  with  a  variety  of  metals  8;  9; 
10.  Certain  protein  parts  have  affinity  towards  other  biomolecules  such  as  nucleotides,  and 
hence  can  act  as  capturing-units  when  used  for  fabrication  of  biosensors  or  drug  delivery 
vehicles.  Proteins  also  provide  attachment  points  for  embellishment  or  molecular 
modification,  thereby  providing  unique  functionality  to  the  supramolecular  structures. 

The  third  requirement  is  the  stability  of  the  protein  assembled  structure,  which  increases  the 
viability  of  downstream  applications.  Availability  of  thermostable  versions  of  protein  tectons 
is  an  added  advantage  for  fabrication  purposes.  Additional  properties,  such  as  easy  expression 
and  purification  of  the  proteins  and  its  variants,  are  required  to  explore  a  molecular  building 
block  for  its  applications  in  bionanotechnology. 

Lsm  proteins  and  their  variants  were  identified  as  meeting  these  requirements.  They 
constitute  a  large  indispensible  protein  family  of  RNA  chaperones  in  all  domains  of  life  n. 
Lsm  proteins  from  diverse  organisms  have  been  found  in  ring  morphologies  with  diameters 
ranging  from  5.5  -  7.5  nm,  with  a  central  cavity  of  0.5  -  1.5  nm  12 .  Regardless  of  the  amino 
acid  sequence,  Lsm  proteins  show  the  propensity  to  assemble  into  ring-shaped  quaternary 
structures,  with  five  to  eight  monomers  forming  the  toroid  complex.  This  feature  made  Lsma 
an  attractive  target  to  explore  as  a  molecular  tecton. 
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All  Lsm  proteins  possess  RNA  binding  motifs  making  them  suitable  as  capturing  devices. 
Thermostable  versions  of  Lsm  proteins  are  also  available  which  exhibit  protein  stability  up  to 
90  °C.  Although  Lsm  proteins  have  been  studied  extensively  for  their  role  in  RNA 
processing,  investigating  the  assembling  properties  for  its  role  in  bionanotechnology  is 
addressed  here  for  the  first  time. 

This  thesis  aimed  to  investigate  the  properties  and  adaptability  of  Lsm  proteins  suitable  for 
applications  in  bionanotechnology.  A  bottom-up  approach  was  used  to  fabricate  molecular 
structures  from  the  self-assembling  thermostable  Lsma  system.  The  native  Lsma  assemblies 
were  modified  by  site-directed  mutagenesis  to  generate  oligomers  of  different  pore  sizes, 
introduce  novel  functionality  and  generate  supramolecular  assemblies  of  interest.  The 
structures  were  determined  through  biophysical  techniques  of  SEC-SLS,  SAXS  and  finally 
TEM.  The  scheme  of  various  approaches  utilised  to  change  the  assembly  and  supramolecular 
organisation  of  Lsm  proteins  is  shown  in  Figure  6.1. 
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Figure  6.1 1  Schematic  diagram  of  multimer  formation  of  Lsma  proteins  generated  by 
targeting  two  regions  of  the  ring  scaffold,  (i)  Interface  modification  (ii)  Surface  modification. 
Six  possible  outcomes  can  be  achieved  using  Lsma  as  a  building  block,  out  of  which  five  have 
been  explored  in  this  thesis. 
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6.2  Control  of  the  pore  size 

Initially,  two  sources  of  Lsm  proteins  were  chosen,  namely  the  eukaryotic  Lsm3  (5. 
cerevisiae )  13  polypeptide  that  assembled  as  an  octamer,  and  the  archaeal  Lsma  (M 
thermoautotrophicum)  which  formed  a  heptameric  ring.  After  initial  expression  studies, 
Lsma  easily  expressed  as  a  soluble,  highly  stable  protein,  and  was  therefore  chosen  as  the 
model  protein  for  this  thesis. 

The  Lsm  protein  complexes  assemble  into  cyclic  oligomers  and  exhibit  a  high  degree  of 
oligomeric  plasticity  assembling  into  different  states  ranging  from  a  trimeric  to  an  octameric 
form  in  different  species  when  generated  in  vitro.  Despite  their  conservation  at  the  amino 
acid  sequence  level  and  similar  3D  folds,  Lsm  proteins  are  commonly  found  in  hexameric 
and  heptameric  states  and  less  frequently  as  a  pentamer  and  octamer.  It  was  hypothesised  that 
the  quaternary  structure  could  be  altered  by  judicious  changes  at  the  inter-subunit  contacts 
and  hence,  cyclic  oligomers  of  different  pore  sizes  obtained. 

Probing  the  inter-subunit  interactions  is  important  to  the  understanding  of  self-assembly  of 
circular  protein  oligomers.  A  strategy  employed  by  Chen  et  al.  to  change  the  oligomeric  state 
of  TRAP  protein  from  B.  stearothermophilus  and  B.  halodurans  was  to  (i)  delete  the  C- 
terminal  region  and  (ii)  insert  a  bulky  hydrophobic  residue.  In  both  cases,  it  caused  rotation 
of  the  subunits  leading  to  the  formation  of  a  12-mer  from  an  11-mer  14 .  This  strategy  was 
successful  because  key  hydrogen  bonds  were  disrupted,  allowing  for  the  formation  of  a  12- 
mer  circular  oligomer. 

Lsm  protein  interface  presents  a  complex  network  of  hydrogen  bonds  and  hydrophobic 
interactions,  hence  a  different  approach  was  employed  to  investigate  the  subunit  interface. 
The  protomeric  interface  of  Lsma  was  studied  by  alanine  screening.  As  observed  by  SAXS 
and  SEC-SLS,  there  was  no  apparent  change  in  the  oligomeric  state  of  most  of  the  muteins. 
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The  error  inherent  in  these  techniques  made  it  difficult  to  distinguish  between  a  heptamer  and 
a  hexamer.  For  all  the  samples  tested,  none  of  them  showed  molecular  weights  greater  than 
that  of  a  heptamer,  therefore,  octameric  and  higher  order  oligomeric  states  can  be  ruled  out. 
The  mutein,  R65P,  consistently  showed  results  corresponding  to  the  molecular  weight  of  a 
hexamer  and  it  would  be  interesting  to  verify  the  quaternary  structure  of  this  mutein  by  X-ray 
crystallography  or  native  mass  spectrometry. 

Two  muteins,  L70A  and  I71A,  exhibited  complete  breakdown  into  the  monomeric  form  as 
assessed  by  SDS-PAGE,  indicating  that  residues  L70  and  171  are  important  in  the  stability  of 
the  cyclic  oligomer.  For  L70A,  a  monomeric  form  was  observed  by  preparative  SEC.  It 
would  be  interesting  to  optimise  the  conditions  to  isolate  this  monomeric  form  in  solution  and 
characterise  it  further.  The  171 A  mutein  did  not  show  presence  of  any  monomeric  form  in 
solution  but  dissociated  completely  on  the  SDS-PAGE  gel. 

In  addition  to  changing  single  residues  at  the  interface,  chimeric  muteins  were  also  generated 
to  alter  the  oligomeric  assembly.  However,  this  change  in  the  entire  interfacing  segments, 
caused  the  protein  to  dissociate  into  monomers  as  assessed  by  SDS-PAGE.  It  most  likely 
disrupted  the  protomeric  interface,  which  resulted  in  failure  of  ring  formation.  This  indicates 
that  the  associations  are  conserved  within  species  and  also  between  each  of  the  protomers. 
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6.3  Generation  of  nanotubes 

Previous  studies  on  synthetically  constructed  Lsm  proteins  have  shown  electrostatically- 
mediated  multimerisation  events  which  were  thought  to  occur  via  stacking  of  the  rings: 

(i)  Helix-to-helix  face  stacking  of  two  Lsm  protein  rings  has  been  observed  in  the  crystal 
structure  of  Lsm3  13.  Packing  interactions  include  a  salt  bridge  (Glu-2  and  Lys-9),  and 
contacts  between  the  imidazole  rings  of  His-5  and  His-6  of  the  N-terminal  His6-tag. 

(ii)  Helix -to-loop  face  interactions  have  been  previously  inferred  in  archaeal  Lsm  protein 
complexes.  Lsma  has  been  found  to  produce  fibril  structures  in  low  ionic  strength  conditions. 
The  interactions  are  hypothesised  to  take  place  via  electrostatic  contributions  of  acidic 
residues  in  loop4  (EDGE)  and  exposed  charged  and  polar  residues  in  the  flexible  N-terminus 
a-helix. 

(iii)  Loop-to-loop  face  intermolecular  stacking  has  been  observed  in  the  crystal  structure  of 
yeast  SmF  15  utilising  loop4  residues. 

The  aim  of  this  research  work  was  to  produce  a  new  level  of  control  over  the  stacking  of 
Lsma  rings.  The  interactions  that  engage  the  same  binding  interfaces  of  two  rings  are 
“symmetrical”  in  nature  and  self-limiting.  Therefore  these  interactions  cannot  provide  for 
regular  multi-ring  polymer  formation  i.e.  extension  via  stacking  into  tubes.  However,  an 
“asymmetrical”  interaction  i.e.  helix-to-loop  face  utilises  two  distinct  binding  interfaces.  This 
type  of  interaction  could  lead  to  polymerisation  and  subsequent  formation  of  high-aspect  ratio 
structures.  Furthermore,  should  such  a  polymer  engage  in  both  “symmetrical”  and 
“asymmetrical”  interactions  it  is  possible  to  obtain  large  multimers  through  repeated 
combinations. 
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An  important  outcome  of  solution  studies  of  polyproteins  conducted  by  Jens  Molls 
demonstrated  that  synthetic  constructs  of  Lsm  proteins  exhibited  multimersation  events  with 
possible  stacking  16.  The  existing  propensity  of  Lsma  to  form  fibril-like  polymers  and  the 
stacking  observed  in  the  crystal  lattice  (PDB  ID  1181),  led  to  the  design  of  engineered 
disulfide  bridges  to  covalently  stabilise  the  stacking  events  in  solution  in  this  work.  The 
results  confirmed  this  hypothesis,  demonstrating  that  engineered  Lsma  heptameric  complexes 
successfully  assembled  into  supramolecular  structures  thus  creating  a  reversible  nanotube  of 
relatively  narrow  length  distribution.  It  provided  a  new  level  of  hierarchical  control  over  the 
assembly  of  Lsma  rings  which  had  not  been  observed  before. 

Organisation  into  higher-order  structures  of  N10C/E61C  was  confirmed  by  TEM  which 
showed  the  arrangement  of  rings  into  tubes.  Solution  based  analysis  of  NIOC  by  SEC  was 
consistent  with  association  into  a  dimer  of  rings.  The  mutein  NIOC  showed  a  “symmetrical” 
interaction  via  helix-to-helix  face  covalent  association.  This  could  be  attributed  to  the  highly 
exposed  engineered  Cys  residues  on  the  flexible  N-terminal  region.  E61C  did  not  associate  at 
the  loop-to-loop  face,  perhaps  due  to  the  very  short  loop4  of  Lsma  and  the  potential 
constraint  provided  by  the  other  structural  elements  in  the  vicinity  of  loop4.  Further 
polymerisation  with  the  double  Cys  mutein  N10C/E61C  was  achieved  by  incubation  of 
N10C/E61C  with  E61C  which  lead  to  the  polymerisation  of  the  rings  in  a  discrete  step-wise 
manner.  Thus,  the  Cys  muteins  have  provided  three  separate  types  of  molecular  tectons  -  (i)  a 
single  ring  with  reactive  Cys  groups  (E61C),  (ii)  a  dimer  of  rings  (NIOC)  and  (iii)  an 
extended  tube  form  (N10C/E61C)  as  shown  in  Figure  6.1. 
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6.4  Effect  of  the  His6-tag  on  the  supramolecular  assembly 

Lsma  was  developed  as  a  construct  with  an  His6-tag.  Its  oligomerisation  was  found  to  be 
dependent  on  the  placement  of  the  tag  at  either  the  N-  or  C-terminus.  Placement  of  the  His- 
tag  at  the  N-terminus  caused  a  marked  change  in  the  supramolecular  organisation  of  Lsma 
due  to  chelation  of  copper  ions.  The  oligomerisation  was  found  to  be  metal-dependent  and 
large  quantities,  40  mg  per  litre  of  culture,  of  highly  pure  and  stable  multimeric  species  of 
Lsma  were  isolated.  Such  nanostructures  were  formed  only  when  exposed  to  certain  metal 
ions  indicating  high  metal  specificity. 

The  position  of  the  His6-tag  played  a  role  in  the  supramolecular  assembly  of  the  Lsma.  The 
P5-strand  extends  into  the  C-terminus  which  then  protrudes  laterally  away  from  the  heptamer 
core  at  the  equatorial  plane  of  the  ring.  Both  solution  studies  and  TEM  revealed  that 
[LsmaH6]7  differed  from  the  properties  of  [H(lLsma]7.  [LsmaH6J7  exhibited  a  heterogeneous 
mixture,  making  it  unsuitable  for  further  studies.  It  could  be  possible  to  obtain  metal-assisted 
two  dimensional  arrays  as  the  His6-tag  is  exposed  laterally  from  the  ring  structure.  It  would 
be  interesting  to  optimise  conditions  for  [LsmaHg]?  to  obtain  a  stable  homogenous  sample. 

Placing  the  tag  at  the  N-terminus,  completely  changed  the  assembly  of  Lsma.  [H6Lstna]7 
assembled  as  cage-like  structures  at  alkaline  pH.  This  was  assessed  by  SEC  and  confirmed  by 
TEM.  The  3D  model  from  the  SAXS  data  corresponded  to  the  dimensions  presented  by 
TEM.  It  was  also  shown  that  the  assembly  is  bound  to  Cu2+  and  further  can  bind  to  Ni2+  in  a 
reversible  manner.  This  is  an  important  property  of  a  molecular  building  block,  conferring  it 
with  a  metal-binding  functionality. 
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From  the  standpoint  of  using  metal  to  induce  assembly  of  proteins,  Huard  et  al.  engineered 
ferritin  protein  to  oligomerise  in  presence  of  the  Cu2+,  generating  a  cage-like  structure  17 .  In 
terms  of  changing  the  assembling  properties,  TMV  exhibits  remarkable  changes  and 
assembling  properties  with  the  attachment  of  a  His-tag,  with  change  in  pH  and  the  position  of 
the  tag  1S.  The  modified  TMV  coated  protein  (CP)  was  also  used  as  a  one  dimensional 
template  for  Au  NPs  synthesis  19 .  Recently,  DNA  modified  with  His6-tag  and  NTA,  was 
assembled  into  arrays  with  addition  of  Ni2+  through  formation  of  metal-chelate  complex  20. 
The  incorporation  of  the  His6-tag  into  the  Lsma  scaffold  drives  a  metal-mediated  assembly  of 
the  protein  which  exhibited  different  levels  of  supramolecular  assembly  with  variation  in  the 
metal  concentration.  Thus,  Lsma  was  engineered  to  show  metal-binding  properties,  in 
addition  to  assembling  as  stacks  in  a  controlled  manner. 

6.5  Future  perspectives  and  outlook 

Solution  and  electron  microscopy  studies  demonstrated  that  Lsma  rings  can  form  stable 
supramolecular  structures  with  both  “closed”  and  “extended”  conformations  (Figure  6.1). 
The  Lsma  proteins  in  this  research  retain  their  RNA  binding  capacity  and  therefore,  the  next 
step  would  be  to  study  the  RNA  binding  ability  of  the  structures  generated.  This  feature  is 
advantageous  when  such  a  tecton  is  used  as  a  component  for  a  biosensor  to  capture 
nucleotides.  This  motif  could  further  be  modified  with  metal-binding  sites  or  serve  as 
attachment  points  to  an  inorganic  surface  21 .  Furthermore,  within  the  Lsm  structure,  each 
monomer  is  folded  such  that  the  unstructured  N-  and  C-terminal  tails  are  naturally  exposed. 
These  segments  can  be  derivitised  with  antimicrobial  peptides  or  even  conjugated  to 
industrially  useful  enzymes  5,  making  Lsm  a  useful  nanoscaffold.  Lsma  derivitised  with  the 
His6-tag  could  be  assembled  on  a  NTA-modified  surface  to  obtain  an  array  of  protein. 
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It  would  be  of  interest  to  apply  the  metal-binding  capability  to  the  nanostructures  generated 
and  to  fabricate  a  biosensor  with  RNA-binding  capacity.  The  nanotubes  can  be  engineered  to 
bind  metal  which  could  lead  to  the  formation  of  metal  wires  through  a  bottom-up  approach, 
with  its  dimensions  confined  to  the  nanometer  range.  The  strategies  employed  to  engineer 
supramolecular  structures  for  Lsma  can  be  extended  to  other  Lsm  systems  including  Lsm 
polyproteins.  Peroxiredoxin  protein  has  been  recently  used  to  capture  Au  NPs  into  one 
dimensional  space  .  Similar,  approaches  have  been  employed  for  TRAP  protein  and  SP1 
protein  24,  wherein  the  doughnut-shaped  structures  act  as  NP  assembling  templates.  Lsm 
proteins  offer  smaller  pore  dimensions  (0.5  -  1.5  nm)  compared  to  other  protein  systems, 
which  is  likely  to  be  advantageous  for  applications  where  small  and  uniform  NPss  are 
required  25;  26 . 

Completely  synthetic  versions  of  Lsm  proteins,  namely  Lsm  polyproteins,  are  being  routinely 
generated  by  Protein  Structure  Group,  Macquarie  University.  Lsm  polyproteins  feature 
individual  Lsm  proteins  covalently  linked  via  unstructured  sequence  extensions  and  allow  for 
simultaneous  expression  of  two  Lsm  proteins  as  heteroheptamers.  The  knowledge  gained 
from  the  Lsma  system  can  be  further  extended  to  Lsm  polyproteins  for  tailor-made 
properties. 

As  depicted  in  the  schematic  (Figure  6.1),  four  possible  strategies  were  explored  in  the  given 
time  frame,  out  of  which  three  gave  fruitful  results.  It  would  be  of  interest  to  generate  arrays 
and  fibrils  in  a  controlled  manner,  for  applications  in  bionanotechnology.  Such  protein 
tectons  can  be  employed  to  control  the  assembly  of  nanomaterials  and  help  generate  metallic, 
inorganic,  or  organic  materials  with  precise  control  over  their  dimensions.  For  metallic 
materials,  the  spherical,  rod-like,  and  planar  architectures  of  proteins  have  found  success  in 
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generating  predesigned  NPs  and  nanowires  27: 2S.  Other  avenues  where  such  building  blocks 
can  be  potentially  applied  are  drug  delivery,  biosensing  and  polymer-composites. 

In  this  study,  insights  were  gained  into  the  controlled  hierarchical  assembly  of  Lsma  proteins 
and  their  potential  use  as  a  molecular  tecton.  Hence,  the  results  obtained  from  Lsma 
complexes  in  this  study  aid  our  understanding  of  self-assembly  and  contribute  to  the  growing 
field  of  molecular  building  blocks. 
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